Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Arst6g03235 ATGGGTGTCCACGGTCTCTGGGAACTACTCGCCCCCGTCGGCCGCCGCGTCTCCGTCGAGACCCTCGCCGGAAAAACCCTTGCAGTCGATGCGAGCATATGGATGGTGCAATTTATGAAGGCGATGCGCGATGAGAAGGGCGAAATGGTTCGAAACGCTCACTTGCTGGGCTTCTTTCGTCGAATTTGCAAGCTCTTGTTCCTCAGGACAAAGCCAGTCTTCGTCTTCGACGGCGGAACCCCCGCCCTAAAGCGCCGCACCGTCATCGCGCGCCGCAGGCAGCGTGAGAACGCCCAGGCCAAAGTCCGCAAGACTGCAGAGAAATTACTTCTCAATCACTTAAAGGCATTGAGGTTGAAAGAAGTTGCTGATGACATTAAGAACCAGAGGCTGCAGCAGAAGAGTGAAGCCAAGGGATGCAAAAAACCTGATCGGATGGACTTTGTCGATAGTGATCCAGGAAAAAGTAACATGAAAGAGATAAATGAAATGTCTTCAGCTAAACTTGCAGCTACAGAGGCAGGAAATTTCTTGCAAGAAGTTGTGTCAAGAAGTCGCAACCAGGAGGAACTTGATGAAATGTTGGCAGCATCTATATCTGCAGAGGAGAATGGAATACTAGCCGGGAAAGAAGTGCCATCTACTGTACCTAATACTTCAGAGGAGAAATTTGACACAGATGGAGAAATGATACTGCCTTGTGACAATGAAGTAGATTTAGCTGTTTTAGCTGCTTTACCACAGTCAATGCAACTTGATATTCTTGCACAGAATGTGTGTGCTTGTGCACTCTATCTTGAAAATATCATCTTCCTTTTACTTGCTTGTGTCATTTTTTCTTTTTATGTAAATGATAGGGGCAAGGGTAAGGGGATTCTGTTTAGAGAAAGTGACTTGGGTGGTTGTAGCTCAAAATGTGACAATGTGATATCAACAAATGACAATCAAGACAAAATTGATGAAATGTTAGCAGCCTCTATTGCTGCGGAGGGCATTGCAAAGTCTATGAGTAATGCATCAACTTTTTTTGAGGCTTCAGCCATTGAGGAAGAGGATGGTGACTATGATGAAGATGAAGAGATGATACTGCCTGCAATGCATGGTGAAGTTGATCCAGCTGTTCTGGCCTCTTTACCTCCATCAATGCAACTGGATCTCCTAGTTCAGATAAGAGAGCGATTAATTGCAGAGAATCGACAAAAGTATCAAAAAGTAAAGAAGGATCCTGCAAAATTTTCTGAGCTACAAATAGAAGCTTACCTTAAAACAGTTGCTTTTCGACGAGAGATAGATGAAGTGCAAAAAGCTGCTGCTGGAAGAGGAGTAGGGGGTATCCAAACTTCACGGATTGCATCTGAAGCCAACAGGGAGTACATATTCTCATCATCTTTTACTGGTGACAAACAAGAACTTGCATCAAGCAGAGCAGAGAAAAATGATGATGCTCATCGTAAGGCTCAAGGAACACATCCTGTGGAGAATCTTGCTAATATTATTGCATCAGCTGGTTCTAATACTACAAGTGGATTGGTTTGCAATGAACCTAGTGAATCTGTTGATGAAAGAATTCAGACCTTCCTAGATGAGAGGGGTCAATTTCGAGTTAGCAGATCAAGAGCTATGGGGATGCGTATGACCCGAGATTTGCAAAGAAATTTGGACTTGATGAAGGAGATTGAGCTGGAAAGAACACATATCAACAAGGCTTCAAATATTGATGCAATTTTGAGTGCAGAGAATAATGGTCCATCAAAAAGTTCTGGGACCAATTCTGTTGGTAAATTAAAAACTATGAATGTTGATCTAGTTGGCGAGTGTGTGCAAAATGAGCAATCTGTCTTTGACAAAGATACTTCTATAGAGGTATCCTTTGAATACGATAGCAAGAATGCACTCATTGATGCTGAAGATGAAATATTTGCTAATTTAGTAGGAGGAAATTCGGGGACAGTATTTCATGCTGATGGTACTCCAGCAAAAGAACATCCTTCTAATTCAGATTCAGATTGTGACTGGGAGGAAGGAATTGTTGAAGGGAAGAATACCTTCATTCCTGGAAATAATAAAGTGGAATGGAATTCTTCTGTTGCTGAAGGGGATAATAATGATGAGAGTGAAGTAGAATGGGAGGAAGGAGATTGTGATGGTGATAAAAGTACCATATGTTGCCCATCTGAGACTGGGAAAAAGCCAACTCGAGGTCAATTGGAGGAGGAGTCTAATTTGCAGGAAGCAATTAGGAGAAGTCTTGAGACTATAGGAGATGGAGAGCTTAAGCACCTATCATCTGTAGATGAGCATTCAAATGCTGATGAGAAAAAATTGGATTCCCATGGTGATTATTTGGATGTTTCTGGTGCAATGAATTTAAATGATGAAGATGCATTTCTGAAGATCAAAAATAGTATGGCTGTTTCATCTTCTCCAAGGGAAGATGGTTCTAAACAAAATATATTTCATGGAAATGTGGATGCCGATGGTTATGTCAACTCCCAAACTTCTGATTTTCCTAGGGGTCAGTCAAAGTCATATGTAGCATCTATTTCTAGTAACCTGGATATATTGATTGAAAAGCCTAATGTACTGAATAGATGCTCTCATTCTGAATATTCGACTTCAGATGCAAATATGATGAAGGACAATGACCATATGGCTGCAGAACAATTGTTGGATAAACATTGTGATGATACTAAGGTGTCTTCCGATGGCAAAAATGTTTCCAAGGATAATCCACTTGGTTCCACTGAATCATCCTTGAAGGGATCAACAGAAAATGTTGATATTGGGCCAAAGTTAGCTGCAGTGGACAATGATGGAAGCTTCAGAGGAGAAAGAAATATTGATCTTGTGAAAAATGCAGTTAACACCTCAGGAGATTTTCCAGCACATGTAGACGAGGTTAGATTGGAGGAGGAAATACGAATACTTGGTCAAGAATATATAAACCTTGAAAATGAGCAGAAGAAACTTGAAAGAAATGCAGAATCTGTCAATAGTGAATTGTTCACAGAATGTCAGGAATTACTGCAAATGTTTGGTTTGCCATATATTATTGCCCCAATGGAAGCTGAGGCACAGTGTGCTTTCTTGGAAACTGCAAAACTGGTTGATGGTGTTATAACTGATGATTCTGACGTCCTTCTATTTGGAGCTCGTAATGTTTACAAGAATATATTTGATGATCGCAAATATGTAGAGACATACTTCATGGAGGACATTGAGAAGGAACTTGGATTGAGCAGGGAAAAATTAGTACGCATGGCACTACTACTTGGGAGTGATTATACCGAAGGTGTAAGTGGCATTGGGATTGTCAATGCTATTGAGGTTGTGAATGCGTTCCCTGAGAAAGATGGCCTCTTGAAATTCCGCCAGTGGGTTGAATCACCAGATCCCACCATTCTTGGATGGTTGAATACAAAAGGTGGATCAACTACAAGAAAGAAAGGATCAAAAGAGACTTCGTCGGATCAAATTAATAGTCATATTAAGGAACAAGAGGATTCGCTGGATTGTGACCAAGAAATCAAGCAAACCTTTTTTGAGAAACATAGAAATGTTAGCAAGAATTGGCATATTCCATCTTCTTTTCCAAGTGAAACAGTTATATCTGCTTATTATTCTCCACAAGTTGACAAGTCAACCGAGCCTTTCACCTGGGGAAAGCCAGATCATCTTGTTCTTCGAAAATTGTGCTGGGAGAAATTTGGGTGGACTAGCCAGAAAGCAGATGAATTACTATTACCTGTTTTAAAGGAATATAACAAACATGAGACTCAACTGCGTTTGGAAGCGTTTTATAGTTTCAATGAACGATTTGCAAAAATTCGTAGCAAGAGAATTAAGAAAGCTGTAAAAGGGATTACTGGTAAGCAGCCTTCAGAATTGAAGGATGATTTCAACAGTGGCAAGAGTAGGAGAGGAAATCATCTAGAATCTGAGGACAAAAATTTTGAGAATCTAAAGGCGACAAAGGAAAGTCTTGAAAGTTTGAAGAAACCTAAAGTGAAAGAATCAAGGAAAAGGAAGAATGATGGGGACATTCTTGGGAAGGCAATGTCAAAGAGAAGGACAATCATTGATGGTCCTTCTTCAGCTTCTGGTATGTCTGCAGTGGAGAATTTACAGCCGGGTACCGAGTCTGAAAAAGACCAAAGTGATAGTAATCCATTGATTTCAAATAGAAGTGGGAGGGGCAGAGGAAGAGGCAGAAGTTTGGCAGTAAAACATGGAAGGAAAAAGGAAAGTCTCATTTATCAATCATCTGGAACGTCATCTTCCAGTAGTGACACTGATGATCATGTTGATATGTCAAAAGTTCCTCAAGAGGTTCGAAGGTCCTCACGTTCCAGAAAACCTGTCAATTATTCACTTGAGAACCCAGAAGATGAAGAACTCAATGAGTCATTTGATACGAGGAATCAATCATCCTTGTGTGAAGATCCATTAGAAGAAAATTTATCTGATATTCCTGGTGCATGTGGGGATTCTGCAACTGGTTTAAGCAGAGGTAAAGAGAGTGATATGATAAGTAGTGCTCCAACCAGGAACTTCCCTAGAGACGATCTTGGGTCGGAAGGTCAATTTTTCACGGATGCTGATGAAACTACTCATCCAGATCCAGGAATTGGTGATGGTGATATTACTGTTAATGCTGACTCTTGTGATGACTACCTTAAACTAGGAGGTGGTTTTTGTTTAGATGATAGTGATGAACCTAGCAATCAGAATGCAGTTGATGGTGTCGATACTGCTGATACTGAAGGATTTCTGCACTGTTCTGTTATGATGGATGAGACTGACCATCATAAAAATGGTTCTGAGATATTACTTTCAGGCACCGATAATGCTCGCAGCGAGATGCAAGAGGGACGCAATGCCTACAATGTTGACAACGAGCCAAATGATAACCTTCCAAATGTTAGTGCCAATGATCAAAATCAAATGGGGGTCTCTGTACCTGAGAATGTTAATCATAATAATGGAAACTACAATGGGGCATTTAGTGCAATGCCATTTTTGAGGAAAAGAAGGAAGAAGTAG 5043 0.4073 MGVHGLWELLAPVGRRVSVETLAGKTLAVDASIWMVQFMKAMRDEKGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGGTPALKRRTVIARRRQRENAQAKVRKTAEKLLLNHLKALRLKEVADDIKNQRLQQKSEAKGCKKPDRMDFVDSDPGKSNMKEINEMSSAKLAATEAGNFLQEVVSRSRNQEELDEMLAASISAEENGILAGKEVPSTVPNTSEEKFDTDGEMILPCDNEVDLAVLAALPQSMQLDILAQNVCACALYLENIIFLLLACVIFSFYVNDRGKGKGILFRESDLGGCSSKCDNVISTNDNQDKIDEMLAASIAAEGIAKSMSNASTFFEASAIEEEDGDYDEDEEMILPAMHGEVDPAVLASLPPSMQLDLLVQIRERLIAENRQKYQKVKKDPAKFSELQIEAYLKTVAFRREIDEVQKAAAGRGVGGIQTSRIASEANREYIFSSSFTGDKQELASSRAEKNDDAHRKAQGTHPVENLANIIASAGSNTTSGLVCNEPSESVDERIQTFLDERGQFRVSRSRAMGMRMTRDLQRNLDLMKEIELERTHINKASNIDAILSAENNGPSKSSGTNSVGKLKTMNVDLVGECVQNEQSVFDKDTSIEVSFEYDSKNALIDAEDEIFANLVGGNSGTVFHADGTPAKEHPSNSDSDCDWEEGIVEGKNTFIPGNNKVEWNSSVAEGDNNDESEVEWEEGDCDGDKSTICCPSETGKKPTRGQLEEESNLQEAIRRSLETIGDGELKHLSSVDEHSNADEKKLDSHGDYLDVSGAMNLNDEDAFLKIKNSMAVSSSPREDGSKQNIFHGNVDADGYVNSQTSDFPRGQSKSYVASISSNLDILIEKPNVLNRCSHSEYSTSDANMMKDNDHMAAEQLLDKHCDDTKVSSDGKNVSKDNPLGSTESSLKGSTENVDIGPKLAAVDNDGSFRGERNIDLVKNAVNTSGDFPAHVDEVRLEEEIRILGQEYINLENEQKKLERNAESVNSELFTECQELLQMFGLPYIIAPMEAEAQCAFLETAKLVDGVITDDSDVLLFGARNVYKNIFDDRKYVETYFMEDIEKELGLSREKLVRMALLLGSDYTEGVSGIGIVNAIEVVNAFPEKDGLLKFRQWVESPDPTILGWLNTKGGSTTRKKGSKETSSDQINSHIKEQEDSLDCDQEIKQTFFEKHRNVSKNWHIPSSFPSETVISAYYSPQVDKSTEPFTWGKPDHLVLRKLCWEKFGWTSQKADELLLPVLKEYNKHETQLRLEAFYSFNERFAKIRSKRIKKAVKGITGKQPSELKDDFNSGKSRRGNHLESEDKNFENLKATKESLESLKKPKVKESRKRKNDGDILGKAMSKRRTIIDGPSSASGMSAVENLQPGTESEKDQSDSNPLISNRSGRGRGRGRSLAVKHGRKKESLIYQSSGTSSSSSDTDDHVDMSKVPQEVRRSSRSRKPVNYSLENPEDEELNESFDTRNQSSLCEDPLEENLSDIPGACGDSATGLSRGKESDMISSAPTRNFPRDDLGSEGQFFTDADETTHPDPGIGDGDITVNADSCDDYLKLGGGFCLDDSDEPSNQNAVDGVDTADTEGFLHCSVMMDETDHHKNGSEILLSGTDNARSEMQEGRNAYNVDNEPNDNLPNVSANDQNQMGVSVPENVNHNNGNYNGAFSAMPFLRKRRKK 1680
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Arst6g03235 1680 ProSitePatterns XPG protein signature 2. 1013 1027 IPR019974 GO:0016788
Arst6g03235 1680 Pfam XPG I-region 1013 1094 IPR006086 GO:0004518
Arst6g03235 1680 MobiDBLite consensus disorder prediction 1417 1431 - -
Arst6g03235 1680 MobiDBLite consensus disorder prediction 1146 1169 - -
Arst6g03235 1680 MobiDBLite consensus disorder prediction 895 924 - -
Arst6g03235 1680 SUPERFAMILY PIN domain-like 2 1094 IPR029060 -
Arst6g03235 1680 Coils Coil 966 1000 - -
Arst6g03235 1680 Pfam Ubiquitin binding region 369 393 IPR025527 -
Arst6g03235 1680 Pfam Ubiquitin binding region 235 257 IPR025527 -
Arst6g03235 1680 MobiDBLite consensus disorder prediction 1285 1524 - -
Arst6g03235 1680 CDD PIN_XPG_RAD2 996 1080 - -
Arst6g03235 1680 PANTHER DNA REPAIR PROTEIN COMPLEMENTING XP-G CELLS-RELATED 1 257 - -
Arst6g03235 1680 PANTHER DNA REPAIR PROTEIN COMPLEMENTING XP-G CELLS-RELATED 1 257 - -
Arst6g03235 1680 SUPERFAMILY 5' to 3' exonuclease, C-terminal subdomain 1079 1275 IPR036279 -
Arst6g03235 1680 MobiDBLite consensus disorder prediction 681 740 - -
Arst6g03235 1680 ProSitePatterns XPG protein signature 1. 70 84 IPR019974 GO:0016788
Arst6g03235 1680 Coils Coil 1311 1331 - -
Arst6g03235 1680 CDD PIN_XPG_RAD2 2 92 - -
Arst6g03235 1680 MobiDBLite consensus disorder prediction 904 924 - -
Arst6g03235 1680 Pfam XPG N-terminal domain 1 97 IPR006085 GO:0004518
Arst6g03235 1680 PRINTS Xeroderma pigmentosum group G/yeast RAD superfamily signature 72 91 IPR006084 -
Arst6g03235 1680 PRINTS Xeroderma pigmentosum group G/yeast RAD superfamily signature 24 38 IPR006084 -
Arst6g03235 1680 SMART HhH_4 1081 1114 IPR008918 GO:0003677|GO:0003824
Arst6g03235 1680 Gene3D - 1 144 - -
Arst6g03235 1680 MobiDBLite consensus disorder prediction 1297 1354 - -
Arst6g03235 1680 MobiDBLite consensus disorder prediction 686 700 - -
Arst6g03235 1680 FunFam DNA repair protein UVH3 1079 1136 - -
Arst6g03235 1680 CDD H3TH_XPG 1082 1213 - -
Arst6g03235 1680 Gene3D - 1079 1133 - -
Arst6g03235 1680 FunFam DNA repair protein UVH3 959 1078 - -
Arst6g03235 1680 MobiDBLite consensus disorder prediction 1432 1459 - -
Arst6g03235 1680 MobiDBLite consensus disorder prediction 133 155 - -
Arst6g03235 1680 SMART xpgineu 1010 1079 IPR006086 GO:0004518
Arst6g03235 1680 MobiDBLite consensus disorder prediction 1139 1169 - -
Arst6g03235 1680 MobiDBLite consensus disorder prediction 1360 1395 - -
Arst6g03235 1680 PRINTS Xeroderma pigmentosum group G protein signature 96 118 IPR001044 GO:0003697|GO:0004519|GO:0005634|GO:0006289
Arst6g03235 1680 PRINTS Xeroderma pigmentosum group G protein signature 2 19 IPR001044 GO:0003697|GO:0004519|GO:0005634|GO:0006289
Arst6g03235 1680 PRINTS Xeroderma pigmentosum group G protein signature 54 77 IPR001044 GO:0003697|GO:0004519|GO:0005634|GO:0006289
Arst6g03235 1680 SMART xpgn3 1 98 IPR006085 GO:0004518
Arst6g03235 1680 Gene3D - 962 1078 - -
Arst6g03235 1680 FunFam DNA repair protein UVH3 1 146 - -
       

Duplication type information


Select Gene Chromosome Start End Duplicated_type
Arst6g03235 Arst-Chr6 99362756 99374331 Dispersed
       

Functional genes information


Select Gene Gene_start Gene_end Function Ath_gene Identity(%) E-value Score
Arst6g03235 1004 1111 Core DNA Replication Machinery Family AT5G26680 35.135 2.20e-12 69.3
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Arst6g03235 K10846 - gmx:100820295 1941.39