Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Arst5g02071 ATGAAGAGGTCTAGGGACGATATTTTCGCGAGTCCTCAACTTAAACGGCCTATAGTGTCTTCTCGAGGAGAAGCGTCCGGGCAACCCCAGATGATGAATGGAGGTGCTGTTATTCAGAAGCTAACTACAAATGATGCATTGGCATATCTCAAGGCAGTGAAGGATATATTTCAGGATAAGAGGGACAAGTATGATGACTTTTTGGAAGTCATGAAAGATTTTAAGGCTCAAAGAGTTGATACTGCAGGCGTGATAGCAAGGGTTAAACAGCTCTTTAAAGGGCACAGAGACTTGCTTTTGGGATTCAACACCTTCTTGCCAAGGGGTTATGAAATCACACTTCCACTGGAGGATGAACAACCTGCCCCAAAGAAGCCTGTTGAATTTGAAGAAGCGATTAATTTTGTGAACAAAATTAAGACCCGGTTTCAAGGTGATGATCGTGTTTACAAGTCATTTCTGGACATATTAAATATGTACAGAAAGGAAAGCAAGTCTATAACAGATGTCTACCAGGAGGTTGCTGCACTTTTCCAAGAGCATCCGGATCTTCTTGATGAGTTTACTCATTTTCTCCCGGACACTTCCTCAGCAGCCTCTGCTCATTATATTTCTGCTAGAAATTCTATGCTTCGTGATAGGAGCTCTGCAATGCCAACTGTACGGCAAATGCATGTTGAGAAGAGAGAAAGGACTATGACCTCACATGGTGATCGTGACCTCAGTGTTGACCGTCCTGACCCTGATAATGACAGAGGTTTGATGAGGGCAGAAAAGGATCAGAGGAGACGTCTAGAGAAGGATAAGGAATGCAGAGAGGAGAGAGATAGGAGAGAACGTGACAGAGATGATAGAGATTATGAGCATGATGGTGGTCGAGATAGGGAAAGGTTCTCCCATAAACGGAAATCTGATCGTAGGGCTGAAGACTCTGGAGCCGAACCATTGCTCGATGATGAACATATTGGTATGCGCCCTATGCCATCCACTTGTGATGAAAGAAATACTCTGAAAAGCATGTATAGCCAGGAGCTTGCTTTCTGTGAAAAGGTCAAAGAGAAATTACGAAATCCTGATGACTACCAGGAATTTTTGAAGTGTTTGCATATTTACAGCAGGGAAATAATTACCAGACATGAATTGCAGTCATTGGTTGGTGATTTACTGGGGAAATATCCAGATCTTATGGAGGGGTTTAATGAATTTTTGATACAATCTGAAAAGAATGATGGTGGATTCCTTGCTGGTGTCATGAATAAAAAGTCCTTATGGAGTGACGGACAAAGGCCGAAATCTGTGAAGGTTGAAGACAGAGATCGTGATCGAGACCGTTGTAGGGATGATGGCATGAAAGAAAGGGATCGTGAATTCCGAGAAAGGGACAAATCCACTGCCCCCGCCAACAAGGATGTTTCAGGTCCTAAGATGTCCATATATCCCAGCAAGGATAAGTATTTGTCAAAGCCTATAAATGAGCTGGACCTTTCTAACTGTGATCAATGCACTCCGAGCTATCGTCTATTGCCAAAAAATTACCCAATACCTGTAGCTAGCCAGAGAACAGAACTTGGTGCAGAAGTATTAAATGATCACTGGGTGTCTGTTACTTCAGGAAGTGAGGATTATTCCTTTAAACATATGCGCAAAAATCAGTATGAAGAGAGCTTGTTCAGATGTGAAGATGACAGGTTTGAACTTGATATGTTGCTAGAGTCTGTAAATGCAACTACTAAGCGAGTGGAAGAGCTATTAGAAAAGGTCAATAATAATATAATCAAAGGAGACAGTCCAATTCGTATTGAGGAGCACTTAACAGCCTTAAATCTTAGGTGCATTGAACGATTATATGGTGACCATGGGCTAGATGTGATGGATGTGTTACGGAAAAATGCATCTCTGGCTTTGCCAGTGATATTAACCCGCTTGAAGCAGAAACAAGATGAATGGGCAAGGTGTCGTGCTGATTTTAATAAAGTCTGGGCTGAAATATATGCCAAAAATTATCATAAATCTCTTGATCACCGTAGCTTCTACTTTAAACAACAGGATACAAAAAGCTTGAGTACTAAAGCATTACTGGCCGAAATCAAAGAAATCAGTGAGAAGAAACGCAAAGAAGATGATGTTCTTCTTGCGATTGCTGCTGGAAATAGACGTCCTGTTCTTCCAAACCTTGAGTTTGAGTACACTGATTCTGATATTCATGAAGATCTGTATCAGCTTGTAAAATATTCTTGTGGAGAAATGTGTACAACTGAACAATTGGATAAAGTTATGAAGGTTTGGACAACATTTTTAGAACCCATTCTGTGTGTTCCTTCTCGGCCTCTGGGTGCCGAGGATACAGAAGATGTTGTCAAGGATAAGAATAATTCTGCCAAAAGTGGCACTGCAAGTGTTGCCGAGAGTGAGGGTAGTGCTGGTGCTGGTGCTATTGTAGTGAATCCTAAGCATATTAACACTTCTAGAAATGGGGATGAGTGTATGCCATTAGATCAATCAAATTCTAGCAAAGTATGGCAATCAAATGGTGACAGTGGTGCAAAAGATGATAAATGTCTTGATTCAGACCGCACTCTGCATAAAACTGAAACTTCAGGCACTAATACACAGCATGTTAAAATTAATACTAGTTCATTCACACCCGATGAAATGTCAGGAGTCAATAAGCAAGACCACTCCAGTGAGCGGTTGGTGAATGCTAATGTTTCACCAGCATTAGGAGTGGAGCTAAGTAATGGAAGAACAAGCATGGATAATGCATCAGGAATCATTGCCACTAATCCGTCTAGACCTGGTAATATTTCTGGCGAAGGTGGAGTTGATTTACCTTCATCAGAGGGTGGTGATTCTACCAGACCAGGTACATCCACAAATGGGACCATCACAGAAGGGACCGAAGTTCACAGGTACCCAGAAGAATCAGTTCGGCAGTTAAAAAGTGAAAGAGAAGAGGGCGAGTTGTCCCCAAATGGAGACTTTGAAGAGGATAACTTTGCTGTTTATGGAGATACTGGTTTGGATGCAGTCCATAAGGGGAAGGATGGCGGTTCGAGTCAGCAATACCAAAACAGAAATGGAGAACAAGCTTTAGGTGAAGTTAGAGGAGAGAATGATGTTGATGCTGACGATGAAGGTGAGGAAAGTCCACACAGGTCATCTGAGGACAGTGAGAATGCTTCCGAGAATGTTGATGTTTCTGGAAGCGAGTCTGCTGATGGTGAGGAATCCCGAGAAGAGCACGAGGATGGGGAAAATGATAACAAAGCTGAGAGTGAAGGTGAAGCTGAAGGAATGGCTGATGCCCATGATGTTGAAGGAGATGGAACCTCCTTGCCATTTTCAGAGCGCTTCCTATTAACTGTAAAGCCACTGGCGAAGCATGTTCCCCCAGCGTTACATGAGAAAGAAAGGACTTCTCGAATTTTTTATGGAAATGATTCCTTTTACGTCCTGTTTAGACTTCATCAGACATTGTATGAGAGGATACAATCAGCAAAGATTAACTCATCATCTGCTGAAAGGAAATGGGGGGCTTCAAATAATACGGGTTCTACTGATCAATACAACAGGTTCATGAATGCGCTCTACAATTTGCTGGATGGTTCATCTGATAATACAAAATTCGAGGATGATTGTCGAGCCATTATTGGAACTCAGTCATATCTCTTATTCACTTTAGACAAGCTGATTTATAAGCTTGTTAAACAGCTTCAAAATGTTGCCACTGATGAGATGGATAACAAGCTTCTCCAATTATATGCCTATGAGAAATCAAGAAAACCAGGAAGATTTGTTGACGCAGTTTATCATGAAAATGCCCGTGTTCTTCTTCATGAAGAGAACATATACCGAATTGAATATTCACCTGGACCTAAGAAATTGTCTCTTCAACTGATGGACTATGGACATGATAAGCCTGAAGTGACTGCCGTGTCAATGGACCCCAACTTTTCAGGCTATTTGTACAACGAATTTTTTTCTGTTGTCTCTGACAAAAAGGAAAAGTCTGGAATTTTCTTGAAGAGGAACAAACGTAGATATGCCTGTGGTGATGACATTTCAAGCGAGGCTGTGGAAGGACTACAAGTTATTAATGGTCTTGAGTGTAAGATATCCTGCAGTTCATCCAAGGTGTCATATGTTTTAGATACGGAAGATTTTTTGTTCCGGAAGAGAAAGAATAGGGCAAAGTCTTTAACCATTAGTTCGAGAAGAGCACAACGGTTCCACAAATTGTTTTCCCTTGCATGCTGA 4257 0.4156 MKRSRDDIFASPQLKRPIVSSRGEASGQPQMMNGGAVIQKLTTNDALAYLKAVKDIFQDKRDKYDDFLEVMKDFKAQRVDTAGVIARVKQLFKGHRDLLLGFNTFLPRGYEITLPLEDEQPAPKKPVEFEEAINFVNKIKTRFQGDDRVYKSFLDILNMYRKESKSITDVYQEVAALFQEHPDLLDEFTHFLPDTSSAASAHYISARNSMLRDRSSAMPTVRQMHVEKRERTMTSHGDRDLSVDRPDPDNDRGLMRAEKDQRRRLEKDKECREERDRRERDRDDRDYEHDGGRDRERFSHKRKSDRRAEDSGAEPLLDDEHIGMRPMPSTCDERNTLKSMYSQELAFCEKVKEKLRNPDDYQEFLKCLHIYSREIITRHELQSLVGDLLGKYPDLMEGFNEFLIQSEKNDGGFLAGVMNKKSLWSDGQRPKSVKVEDRDRDRDRCRDDGMKERDREFRERDKSTAPANKDVSGPKMSIYPSKDKYLSKPINELDLSNCDQCTPSYRLLPKNYPIPVASQRTELGAEVLNDHWVSVTSGSEDYSFKHMRKNQYEESLFRCEDDRFELDMLLESVNATTKRVEELLEKVNNNIIKGDSPIRIEEHLTALNLRCIERLYGDHGLDVMDVLRKNASLALPVILTRLKQKQDEWARCRADFNKVWAEIYAKNYHKSLDHRSFYFKQQDTKSLSTKALLAEIKEISEKKRKEDDVLLAIAAGNRRPVLPNLEFEYTDSDIHEDLYQLVKYSCGEMCTTEQLDKVMKVWTTFLEPILCVPSRPLGAEDTEDVVKDKNNSAKSGTASVAESEGSAGAGAIVVNPKHINTSRNGDECMPLDQSNSSKVWQSNGDSGAKDDKCLDSDRTLHKTETSGTNTQHVKINTSSFTPDEMSGVNKQDHSSERLVNANVSPALGVELSNGRTSMDNASGIIATNPSRPGNISGEGGVDLPSSEGGDSTRPGTSTNGTITEGTEVHRYPEESVRQLKSEREEGELSPNGDFEEDNFAVYGDTGLDAVHKGKDGGSSQQYQNRNGEQALGEVRGENDVDADDEGEESPHRSSEDSENASENVDVSGSESADGEESREEHEDGENDNKAESEGEAEGMADAHDVEGDGTSLPFSERFLLTVKPLAKHVPPALHEKERTSRIFYGNDSFYVLFRLHQTLYERIQSAKINSSSAERKWGASNNTGSTDQYNRFMNALYNLLDGSSDNTKFEDDCRAIIGTQSYLLFTLDKLIYKLVKQLQNVATDEMDNKLLQLYAYEKSRKPGRFVDAVYHENARVLLHEENIYRIEYSPGPKKLSLQLMDYGHDKPEVTAVSMDPNFSGYLYNEFFSVVSDKKEKSGIFLKRNKRRYACGDDISSEAVEGLQVINGLECKISCSSSKVSYVLDTEDFLFRKRKNRAKSLTISSRRAQRFHKLFSLAC 1418
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Arst5g02071 1418 Pfam Sin3 family co-repressor 500 590 IPR013194 -
Arst5g02071 1418 SUPERFAMILY PAH2 domain 337 403 IPR036600 GO:0006355
Arst5g02071 1418 Coils Coil 255 275 - -
Arst5g02071 1418 Pfam C-terminal domain of Sin3a protein 1143 1389 IPR031693 -
Arst5g02071 1418 SUPERFAMILY PAH2 domain 123 194 IPR036600 GO:0006355
Arst5g02071 1418 MobiDBLite consensus disorder prediction 1045 1059 - -
Arst5g02071 1418 MobiDBLite consensus disorder prediction 425 478 - -
Arst5g02071 1418 MobiDBLite consensus disorder prediction 223 327 - -
Arst5g02071 1418 ProSiteProfiles PAH domain profile. 39 109 IPR003822 GO:0006355
Arst5g02071 1418 Pfam Paired amphipathic helix repeat 62 106 IPR003822 GO:0006355
Arst5g02071 1418 Pfam Paired amphipathic helix repeat 359 403 IPR003822 GO:0006355
Arst5g02071 1418 Pfam Paired amphipathic helix repeat 149 192 IPR003822 GO:0006355
Arst5g02071 1418 FunFam Paired amphipathic helix protein SIN3 335 408 - -
Arst5g02071 1418 Gene3D Paired amphipathic helix 335 407 IPR036600 GO:0006355
Arst5g02071 1418 ProSiteProfiles PAH domain profile. 125 195 IPR003822 GO:0006355
Arst5g02071 1418 Gene3D Paired amphipathic helix 124 196 IPR036600 GO:0006355
Arst5g02071 1418 ProSiteProfiles PAH domain profile. 337 406 IPR003822 GO:0006355
Arst5g02071 1418 Coils Coil 566 590 - -
Arst5g02071 1418 MobiDBLite consensus disorder prediction 211 331 - -
Arst5g02071 1418 SUPERFAMILY PAH2 domain 39 108 IPR036600 GO:0006355
Arst5g02071 1418 PANTHER SIN3B-RELATED 40 1287 IPR039774 GO:0003714
Arst5g02071 1418 MobiDBLite consensus disorder prediction 949 965 - -
Arst5g02071 1418 FunFam Paired amphipathic helix SIN3-like protein 124 195 - -
Arst5g02071 1418 Gene3D Paired amphipathic helix 45 107 IPR036600 GO:0006355
Arst5g02071 1418 MobiDBLite consensus disorder prediction 782 854 - -
Arst5g02071 1418 MobiDBLite consensus disorder prediction 1082 1104 - -
Arst5g02071 1418 MobiDBLite consensus disorder prediction 921 1110 - -
Arst5g02071 1418 FunFam Paired amphipathic helix protein Sin3 45 107 - -
Arst5g02071 1418 SMART hdac_interact2seq4b 497 597 IPR013194 -
Arst5g02071 1418 MobiDBLite consensus disorder prediction 967 988 - -
Arst5g02071 1418 MobiDBLite consensus disorder prediction 430 466 - -
Arst5g02071 1418 MobiDBLite consensus disorder prediction 817 844 - -
       

Duplication type information


Select Gene Chromosome Start End Duplicated_type
Arst5g02071 Arst-Chr5 30120605 30132600 Dispersed/Tandem
       

Functional genes information


Select Gene Gene_start Gene_end Function Ath_gene Identity(%) E-value Score
Arst5g02071 28 144 WRKY Transcription Factor Family AT4G12020 50.427 5.91e-25 111
       

Transcription factors information


Select Regulatory Factors Family Gene Hmm_acc Hmm_name E_value Clan
TR Others Arst5g02071 Sin3a_C 3.2e-57 No_clan
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Arst5g02071 K11644 - gmx:100811928 2258.03