Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Arst5g00922 ATGGAAGGGTCTCAGATAGAAAAGCAAGGTGAAGAAAAATGGACTGTCTGTCATTTGTCGACTAATTCGGATGTTCAATGCAGCAGAGATTCAGGTTGTGGCTTTCAGGGGGAAGACCAGAAGGAAGACTCCGCTCTCGATGATCTTGATGATGATTTGATAAATGAACCTTGCTTAACATCTGAAAATTCTCTTTTGGTTGTGGATACGATTGAAAGCGAATCACCAAACAACAGGGAAGGAGATTTATCATTTTCTGAACCCAAGTGGCTAGAGGGGGATGAACCTGTGGCATTATGGGTCAAGTGGAGAGGAAAGTGGCAGACTGGCATCAGATGTGCGAGAGCTGACCGGCCATTATCAACTTTGAGAGCAAAACCAACTCATGGACGGAAGAAATATTTTGTCATATTTTTTCCACACACAAGGAATTATTCTTGGGCAGATATGCTGCTTGTCCGATCAATCGATGAGTTTCCCCATCCTATTGCATATAAGACTCATTGGGTAGGATTAAAAATGGTAAGTGACTTGACTATTGCACGGCGGTTTATAATGCAGAAGCTAGCTATTGGAATGCTGAATATTGTTGATCAGCTTCATCCAAATGCACTGACAGAAACTGCCCGTGATGTGAAGGGCTGGAAGGAATTTGCCATGGGTGCTTCCCATTGTACTGGTTATTCAGAATTTGGAAGAATGCTTCTAAAGCTGCACAATAGCATATTGCAGCCCTACATAAATGCTGATTGGTTACAGCATTCTTCTCACTCTTGGGCTGAAAGATGTCATATTGTGAATAGTGCCGAATCTGTAGAGCTGCTGAAGGAGGAATTGTCTGATTCTATTTTATGGAATGATGTCAACTCTCTTTGGAGGACACCAGCACAACCAATGTTAGGTTCTGACTGGAAATCCTGGAAGCATGATGTAATGAAATGGTTTACAACTTCACCTTCCTTATCTTGCAGCAAAGACACACAGCCACAGACTCCTGATGCTTCGTATGCTGCAAACCTTCAGGTTTCTAGAAAACGGGCCAAACTTGAAGTTCGTCGGGCAGATACACATGCTTCGCAAGCGGAAAGGAAAGATGTGAATCAGTCTGTTGCTCTTGAGACTGACCTTGGGTTCTTTAAGAATCAGGATACATTGAGCCTATTAGCAGCTGAGAATGGTAAACATGAAGATGTTAGGGAGGTATCTGCTGCGAACGATTTAACTAGTAATGTGGCCAATAAATGGAATGAAATTGTAGTTGAGGCTGCCGATTCGGATTGCTTGCATACCAAAGGAATGGAATTAACACCTGTGAGTGAAATGGCTGTTTCAAAATCAGTAGAGCCCGGTAGTAAGAATCGACAATGTATAGCCTATATTGAATCAAAGGGAAGACAGTGTGTGAGATGGGCAAATGATGGTGATGTTTATTGTTGTGTACATTTGTCCTCTCGATTTTTGGGCAGCTCGGAAAAAGCTGAGAAGCTGGTTCCAGTTGATACTCCTATGTGTGAAGGTACTACTGTTCTGGGTACAAAATGCAAGCACCGAGCCCTACCTGGTTTTCTATTCTGTAAGAAACACAAGCCTCATGATGAAACAGAGATCTCACATTCACCACAAAGTACACTGAAAAGGAAACTTGAAGAAAATTATGCTGGTTCAGTGAACACCTGCAGAGACATTGTGTTGGTAAATTCTGAAAGCCCACTGGAAGTGGAGCCAGTGTCATTCATTGGCGGTGATTACTTTCAAAGAAAAAGCAGCTTAGGTGAGAATCCCACACATCCTGATAATTATGATGCAACGAAGGGTCTGCACTGTATTGGTTCTCCTCCCTTTGATGACAAGAATCCATGTAGAGAAGCTCCTAAGCGATATTGCTTGTATTGTGAAAATCACCTTCCTAGCTGGCTTAAACGTGCAAGAAATGGGAAGAGTAGAATCGTATCGAAAGAAGTGTTCACAGAACTTTTGAGGGACTGCAACTCGTGGGAGCAAAAGGTGCATTTGCATAAAGCATGTGAGCTTTTCTACAGGCTGTTCAAAAGCATTTTATCACTAAGAAACCCAGTTCCTAAGGATGTTCAATTCCAGTGGGCTTTGACTGAAGCCTCTAAAGATACTAGTGTAGGAGAATTCTTTACAAAGTTGGTTCATACTGAGAAGGCTAGAATCAAACTAATTTGGGGATTTGGTGATGACTTGGATGTATCTCCTATCATGGAAGGACCACCAGTTTTGCCATCCACAACCACTGATAGCATCGATAATGAAAATGCCATTAAGTGTAAGATATGCTCTGCAGAATTCAGTGATGACCAAGCTCTTGGTAACCATTGGATGGATAGTCATAAAAAGGAAGCACAGTGGCTGTTTAGGGGTTATGCATGTGCCATTTGTCTGGATTCTTTTACTAACAAGAAACTGTTGGAAACACATGTTCAAGAGCGACACCATGTGCAGTTTGTTGAACAGTGTATGCTTCTACAATGTATTCCTTGTGGTAGTCATTTCGGAAACAGTGAGCAATTATGGCAGCATGTTTTATCAGTTCATCCTGTTGATTTAAAGCCATCAAAAGCTCCTGAGCGGCAAACTCTCCCTGCTGGAGCTGGTCAAGATTCTCCAGTAAAACATGTCCAAGGAAACTCTGCTCCTTTGGAGAATAATTCTGAGAATCCGGGTGTCTTGCGGAAGTTCACTTGCAGGTTTTGTGGGTTGAAATTTGATTTGTTACCTGACCTTGGTCGACATCACCAAGCTGCCCATATGGGACCCAATCTAGTTAGCAACCACCCTGCGAAGAAGGGGGTTCGCTACTATGCATATAGATTAAAATCTGGCAGACTTAGCCGTCCTAAATTTAAAAAAGGTTTGGCAGCAGCATCATATAGGATCAGAAACAGGGCTAATGCTAATCTAAAGAGAGGTATCCAAGCAACAAAATCGCTTGGCATGAGGGACATGTCCTTACAACCTCGTGTAACGTCATTACAACCTCAGGTGACAACATCCTTACAGCCTCATGTAACTGAAACATCAAAGATTAGTAAATTCGTGGAACATCAGTGCTCGGGAGTTGCAAAAATTTTATTTTCTGAGATTCAGAAGACTAAACCTCGGCCCAACAATCTTGATATTCTATCAATTGCTCGCACTGCTTGCTGCAAAGTTAGTCTTGCAGCCTCACTTGAGGAGAAATTTGGAATTCTACCTAAAAGAATATATTTGAAGGCAGCAAAACTTTGTGCTGAGCATAATATTATTGTAAATTGGCATCATGAGGGGTTTATTTGTCCTAGAGGCTGCAATGGTTTGAAGGATCAAGCATTGCTCTCTCCCTTATCATCTCTTCCTAATAGTTTTGTGAGGCCAAAATCTGTAAATGTATCAGATCATGCAAGTGATGAGTGGGAACTGGATGAATTTCATTGCATCATCAATTCACATGGTCTAAAGTTAGGGTCACTGCAAAAATCTTCAATCTTTTGTGATGATATAAGCTTCGGGAAGGAATCAACTCCTGTGATTTGTGTAGTAGATCAAGAACTTTTGCATTCTATCAATAAGAATGATTCCAATGATCAAGATACTGACTCTGCTATGCCTTGGAAGAGCTTTACCTATGTTACAAAGGCAATGCTTGATCAATCCCTTAGTCTTGATTCGGAGGTAGTTCTCACTGTGAATCTTGAAATTTACTTTCCTAGGCACACATGGTGTTCCTGCTCATATTCTTCATGCTGTCCTGAAACATGTGACCATGTATACCTTTTTGGTAATGACTATGAGGATGCGAACGACATATTTGGGAAACCAATGCGTGGCAGGTTCCCATATGACGAGAATGGTCGAATAATATTAGAGGAAGGTTACCTTGTCTATGAGTGTAACCGAAGGTGCAGATGCAATAAGTCCTGTCCAAACAGAATATTACAGAATGGAGTACGAGTCAAGTTGGAAGTCTTTAAAACAGAGAATAAGGGATGGGGGGTAAGGGCTGGGGAGGCTATCCTACGTGGCACATTTGTATGCGAGTACATTGGAGAGGTTTTAGATGTTCAGGAGGCACATAACAGGCGCAAGAGATATGGCACAGAACATTGCAGTTATTTCTATGACATCGATGATCATGTTAACGATATGAGCAGATTGATAGAAGGACAGGCGCACTATATAATAGATGCTACTAAATATGGAAATGTGTCCAGGTTCATCAATCATAGCTGCTCGCCGAATCTTGTGAATCACCAAGTTTTAGTCGAGAGCATGGATTGCGAGCGTGCGCACATCGGTCTTTATGCAAGTCGGGATATTGCTCTGGGTGAAGAGTTGACACATGACTATCATTATAAGCTTGTGTCTGGAGAAGGAACTCCTTGCCTTTGCGGCGCTTCCAAGTGCAGGGGACGCCTTTATTAG 4425 0.4199 MEGSQIEKQGEEKWTVCHLSTNSDVQCSRDSGCGFQGEDQKEDSALDDLDDDLINEPCLTSENSLLVVDTIESESPNNREGDLSFSEPKWLEGDEPVALWVKWRGKWQTGIRCARADRPLSTLRAKPTHGRKKYFVIFFPHTRNYSWADMLLVRSIDEFPHPIAYKTHWVGLKMVSDLTIARRFIMQKLAIGMLNIVDQLHPNALTETARDVKGWKEFAMGASHCTGYSEFGRMLLKLHNSILQPYINADWLQHSSHSWAERCHIVNSAESVELLKEELSDSILWNDVNSLWRTPAQPMLGSDWKSWKHDVMKWFTTSPSLSCSKDTQPQTPDASYAANLQVSRKRAKLEVRRADTHASQAERKDVNQSVALETDLGFFKNQDTLSLLAAENGKHEDVREVSAANDLTSNVANKWNEIVVEAADSDCLHTKGMELTPVSEMAVSKSVEPGSKNRQCIAYIESKGRQCVRWANDGDVYCCVHLSSRFLGSSEKAEKLVPVDTPMCEGTTVLGTKCKHRALPGFLFCKKHKPHDETEISHSPQSTLKRKLEENYAGSVNTCRDIVLVNSESPLEVEPVSFIGGDYFQRKSSLGENPTHPDNYDATKGLHCIGSPPFDDKNPCREAPKRYCLYCENHLPSWLKRARNGKSRIVSKEVFTELLRDCNSWEQKVHLHKACELFYRLFKSILSLRNPVPKDVQFQWALTEASKDTSVGEFFTKLVHTEKARIKLIWGFGDDLDVSPIMEGPPVLPSTTTDSIDNENAIKCKICSAEFSDDQALGNHWMDSHKKEAQWLFRGYACAICLDSFTNKKLLETHVQERHHVQFVEQCMLLQCIPCGSHFGNSEQLWQHVLSVHPVDLKPSKAPERQTLPAGAGQDSPVKHVQGNSAPLENNSENPGVLRKFTCRFCGLKFDLLPDLGRHHQAAHMGPNLVSNHPAKKGVRYYAYRLKSGRLSRPKFKKGLAAASYRIRNRANANLKRGIQATKSLGMRDMSLQPRVTSLQPQVTTSLQPHVTETSKISKFVEHQCSGVAKILFSEIQKTKPRPNNLDILSIARTACCKVSLAASLEEKFGILPKRIYLKAAKLCAEHNIIVNWHHEGFICPRGCNGLKDQALLSPLSSLPNSFVRPKSVNVSDHASDEWELDEFHCIINSHGLKLGSLQKSSIFCDDISFGKESTPVICVVDQELLHSINKNDSNDQDTDSAMPWKSFTYVTKAMLDQSLSLDSEVVLTVNLEIYFPRHTWCSCSYSSCCPETCDHVYLFGNDYEDANDIFGKPMRGRFPYDENGRIILEEGYLVYECNRRCRCNKSCPNRILQNGVRVKLEVFKTENKGWGVRAGEAILRGTFVCEYIGEVLDVQEAHNRRKRYGTEHCSYFYDIDDHVNDMSRLIEGQAHYIIDATKYGNVSRFINHSCSPNLVNHQVLVESMDCERAHIGLYASRDIALGEELTHDYHYKLVSGEGTPCLCGASKCRGRLY 1474
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Arst5g00922 1474 SMART PostSET_3 1458 1474 IPR003616 -
Arst5g00922 1474 Pfam SET domain 1330 1450 IPR001214 GO:0005515
Arst5g00922 1474 SMART set_7 1319 1457 IPR001214 GO:0005515
Arst5g00922 1474 SUPERFAMILY SET domain 1159 1464 IPR046341 -
Arst5g00922 1474 ProSiteProfiles Pre-SET domain profile. 1240 1316 IPR007728 GO:0005634|GO:0008270|GO:0034968|GO:0042054
Arst5g00922 1474 MobiDBLite consensus disorder prediction 24 44 - -
Arst5g00922 1474 MobiDBLite consensus disorder prediction 878 893 - -
Arst5g00922 1474 Pfam Zinc finger C2H2-type, 3 repeats 798 925 IPR040689 -
Arst5g00922 1474 ProSitePatterns Zinc finger C2H2 type domain signature. 764 785 IPR013087 -
Arst5g00922 1474 ProSitePatterns Zinc finger C2H2 type domain signature. 832 853 IPR013087 -
Arst5g00922 1474 Coils Coil 344 364 - -
Arst5g00922 1474 SMART c2h2final6 830 853 IPR013087 -
Arst5g00922 1474 SMART c2h2final6 762 785 IPR013087 -
Arst5g00922 1474 SMART c2h2final6 901 924 IPR013087 -
Arst5g00922 1474 SMART c2h2final6 796 819 IPR013087 -
Arst5g00922 1474 ProSiteProfiles Post-SET domain profile. 1458 1474 IPR003616 -
Arst5g00922 1474 SMART preset_2 1165 1303 IPR007728 GO:0005634|GO:0008270|GO:0034968|GO:0042054
Arst5g00922 1474 ProSiteProfiles Zinc finger C2H2 type domain profile. 796 819 IPR013087 -
Arst5g00922 1474 MobiDBLite consensus disorder prediction 860 893 - -
Arst5g00922 1474 Gene3D SET domain 1157 1474 IPR046341 -
Arst5g00922 1474 ProSiteProfiles Zinc finger C2H2 type domain profile. 762 790 IPR013087 -
Arst5g00922 1474 ProSitePatterns Zinc finger C2H2 type domain signature. 903 924 IPR013087 -
Arst5g00922 1474 PANTHER HISTONE-LYSINE N-METHYLTRANSFERASE SUVR5 2 1474 - -
Arst5g00922 1474 PANTHER HISTONE-LYSINE N-METHYLTRANSFERASE SUVR5 2 1474 - -
Arst5g00922 1474 Gene3D Classic Zinc Finger 749 822 - -
Arst5g00922 1474 ProSiteProfiles Zinc finger C2H2 type domain profile. 901 929 IPR013087 -
Arst5g00922 1474 ProSiteProfiles SET domain profile. 1319 1451 IPR001214 GO:0005515
Arst5g00922 1474 Pfam Pre-SET motif 1169 1311 IPR007728 GO:0005634|GO:0008270|GO:0034968|GO:0042054
       

Duplication type information


Select Gene Chromosome Start End Duplicated_type
Arst5g00922 Arst-Chr5 8692702 8702942 Dispersed/Tandem
       

Functional genes information


Select Gene Gene_start Gene_end Function Ath_gene Identity(%) E-value Score
Arst5g00922 70 1473 C2H2 Transcription Factor Family AT2G23740 50.669 0.0 1362
       

Transcription factors information


Select Regulatory Factors Family Gene Hmm_acc Hmm_name E_value Clan
TF C2H2 Arst5g00922 Pre-SET 8.7e-15 No_clan
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Arst5g00922 - - adu:107487826 3042.68