Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Arst3g01354 ATGGCCCAACAGTCATCTAGGCTTCAGCGGTTGCTTACTCTGTTGGATACTGGGTCAACCCAAGCGACAAGGTTGACTGCTGCTCGTCAGATAGGGGAAATTGCAAAATCACACCCTCAAGACTTGAACAGTCTTCTTAAGAAGGTTTCTCAATATCTGCGCAGCAAAAATTGGGAGACAAGAGTCGCTGCTGCTCATGCAATTGGGTCTATTGCTGAGAATGTTAAACACACAAGTTTGAATGAACTTACTACATCTGTTGTATCAAAAATATCTGAGAATGGGATATCATGTAGTGTTGAGGATCTTTGTGCATGGACATATTTGCAATCAAGAATTACAGGAAGCTCATTTAGAAGTTTTGACATGAGCAAGGTGCTTGAATTTGGAGCTTTATTAGCATCTGGAGGACAGGAATATGATATTGTAAGTGATAATATAAAGAACCCAAAGGAGCGACTGGTACGCCAAAAGCAAAATCTTCGACGCCGTTTAGGTTTGGATGTTTGTGAACAATTTATGGATATGAGTGATGTAATAAGAGATGAAGATCTTATGACACACAAGTCAGATTCACATCCCAATGGAATAGATCATGGAGTTTTTACTTCTTCTTCTGTGCATAACATCCAGAAAATGGTTGCAAACATGGTTCCTAGTGTCAAATCAAAGTGGCCAAGTGCAAGAGAACTGAATCTTCTAAAGCGCAAAGCAAAAATAAACTCAAAGGACCAGACAAAAAGCTGGGTCGAAGATGGTGTTACCGATCCATCAGGTGCTCAGAATTCGGCTTCGAAAGCCACGTATCCTGAGTCAGTTAATTACAATAAGGTATTCATGGATGTTAATCATGATGATGATGGCCTTGACCATGATGGGGATGGACAATGGCCTTTCCACACTTTTGTTGAGCAACTTATTATTGATATGTTTGATCCAGTTTGGGAGATCCGACATGGTAGTGTGATGGCACTGAGAGAAATTTTAACACATCAAGGTGCTTCTGCTGGGGTATTGAAACATGACTTACGCTTGGGTGGGAACTTTATGGTCGAATTAGAAGACAAAAGTATAACTATTGAAAAAAAAGAAGACAGTAGTATGTCAAATATATTGAAGAGAGAGAGGGAGATTGATTTGAATATGCAAGTTTCTGCAGATGAGTTTGACTCAAACTTGAAGAGACCAAAACTTGAAAATGTAACATCATCAACCTCGATGGATAGTCTAATTACTTGCAGCAATGAAGGTGATACTAAAATCAGCATTAGTTCTGAAACCTCTGGGTACAATTTACCTTTGGATTGTGTAAATGGGCAATTTAATTGTAATTCTGTTGAGATGGATGTGGTATCTTTCTCTGATGGACTGCAAGATGCATGCAAGGAACCTACTATTGTAGCAGAGCAGAAGGGTTATTCTGAAGAGATTCCCTCCAGAAATCTTAATGTGCTGAGAAATGTTCCCCAGAATTCCGAGCTGATGAGCATGGTTAAAGTTGCCAGAAGTTCATGGTTGCGAAACTGTGGATTTCTTCAAGATTGTGTGATACGCTTTTTGTGTGTGTTGTCACTAGACCGCTTTGGAGATTATGTATCCGATCAGGTTGTTGCTCCAGTGCGTGAAACCTGTGCACAGGCATTAGGTGCGGCATTTAAGTACATGCATCCGGCACTGGTAAATGAAACATTGAATATATTGTTGAAAATGCAGTGTAGACCAGAATGGGAGATTCGTCATGGAAGTCTTCTAGGTATCAAATATTTGGTTGCTGTGCGGCAGGAGTTGCTACCTGACTTGCTTGGACTTGTTCTTCCTGCATGTAAATCTGGGTTGGAGGACCCGGATGATGATGTCCGAGCTGTTGCTGCTGATGCCTTAATACCAGCTTCAGCTGCAATCGTTGCATTGCAGGGTCCAACATTGCGCACAATTGTGATGCTGCTGTGGGATATATTGCTTGATTTGGATGATTTAAGTCCATCCACTAGCAGTGTTATGAATTTGCTGTCAGAAATTTATTCTCAAGAAGAGATGGTCCCAATGATGTATGAAGTTCTTAGATTGGGAGACAACGGAATATCTATTCAAAATGGAGTCGGGGGTGGTGATGATGATGATGATGATGAAGAAAATCCTTATGTGCTTTCAACGTTGGCACCACGTTTGTGGCCTTTTATGAGGCATAGTATCAGCTCTGTCCGGTATTCAGCAATGCGGACTCTGGAGAGGCTACTTGAAGCTGGATATAAGAGAAGAATGTCCGAGTTGTCTAATGCTTCGTTTTGGCCCTCTATCATATTTGGAGATACCCTTAGAATTGTATTTCAGAATCTGCTATTGGAAACAAATGAGGATATTTTGCAATGTTCAGAGAGAGTGTGGAGTCTCCTTGTTAAGTGTTCTGTGGAGGACTTGGATACTGCTGCGAGATCTTACATGGCTTCTTGGATAGAACTTGCTTCTACACCATTTGGTTCAGCATTAGATGCCTCAAAGATGTTTTGGCCTGCTGCATTTCCAAGGAAAAGTCAATTTAGAGCTGCTGCTAAAATGAGAGCTGTGAAGTCTGAATATGATTATGGTGGGGATTTTGGTCTTGATTCTACAAAAGGATCTGTTCCACAAGAAAGAAATGGTGATGCTGCTATGGATTCAGTGAAGATAGTTGTTGGTGCTGATGTGGACACATCTGTTACTCATACACGGGTGGTTACAGCAACTGCATTGGGAATTTTTGCCTCTAAGCTGCCAGAGGATTCTTTGAAGTATGTTGTTGATCCACTCTGGAGGTCTCTAACCTCTTTGTCTGGTGTTCAACGTCAGGTTGCATCTATGGTACTCATTTCTTGGTTCAAGGAGATTAAGAGGACAAATTCATCAGATTCACTTCTTCCATATGCTGAACTTTCAAGAACTTATTCAAAGATGCGTAATGAGGCTGGTCAATTGCTAAATGCTATCAAGTCTTCTGGTATGTTTGATGAGTTATTGTCAACTACCAAAATTGAGTTGGATAGCTTGAGTGTGGATGGTGCCATTAGTTTTGCATCAAAAGTTCCAGCAGTGTGTAATGATAGTTCTCTGAATGAGTCCTTGATAAAGAATACTTTAGATGATATAGAATCCACAAAACAGAGGCTCTTGACAACTTCTGGCTATTTGAAATGTGTGCAGAGTAATTTGCATGTTACAGTCTCATCTGCAGTTGCAGCTGCTGTTGTTTGGATGTCTGAATATTCTTCTCGGCTTACCCCAATCATTTTGCCTTTGATGGCTTCAATCAGACGAGAGCAGGAGGAAATACTGCAAATGAAGTCAGCTGAAGCACTTGCTGAGCTAATATATCATTGTGTTGCTCGTAGGCCATGCCCAAATGATAAGTTAATTAAAAATATATGCAGTATGACATGCTTGGATCCTTCCGAGACCCCTCGAGCTAAACTCATTTGCTCCATGGAGAGTATTGATGACCAAGGTCTTCTGTCGTTTGGAACTCCTGTGAGCAAACATAAATCAAAGGTCCATGTTTTGGCTGGTGAAGATCGATCAAAAGTGGAAGGGTTCATAAGTAGACGTGGGTCGGAATTAGCGTTGAGGCTTTTGTGCGAGAAGTTTGGTGCTTTGTTATTTGATAAGCTTCCTAAGCTGTGGGATTGCCTTACTGAGGTGCTTAAACCTAGCTCCTCTGAATCTCCAGCAGTCACTAATGAAAAACAAGCTACTATGGCTGTTGAGTCTATTAGTGATCCCCAGATATTGATTAACAATATTCAGGTGGTACGATCTATTGCGCCCTTGTTAAATGAGGAGTTAAAGCCAAAATTATTGACACTTCTCCCGTGCATTTTTAAATGCATTCAACATTCTCATGTTGCAGTTAGATTAGCTGCTTCACGCTGCATCACTTCCATGGCCCGGTCAATGACTGTGAAAGTTATGAGTGCTGTGGTTGAAAATGCTATCCCAATGTTGGAAGATGCATCATCAGTTCATGCTCGTCAAGGAGCAGGCATGTTGATTAATTTTCTTGTTCAGGGCCTGGGGGTAGAGTTGGTCCCTTACGCTCCATTGTTGGTAGTTCCACTTCTAAGGTGCATGAGTGACTGTGATCAGTCTGTCAGGCAGAGTGTGACCCATAGTTTTGCTGCTCTTGTGCCTTTACTTCCTTTGGCACGAGGCCTTCCTCAACCAGTTGGATTGGGGGAAGGTATATCTAGAAATGCTGATGATTTGCAATTTTTGGAGCAACTGCTTGACAACTCCCACATTGAAGATTACAAGTTATGCACTGAATTGAAAGTCACATTGAGAAGGTACCAACAAGAAGGCATAAATTGGTTAGCTTTTTTGAAACGTTTCAAGCTTCATGGAATTTTATGTGATGACATGGGGCTTGGTAAGACGCTTCAGTCATCGGCTATTGTGGCCTCTGATATTGCTGAGCATCGAACTCAGTCTGGGAATGGAGATCTTCTACCATCTTTGATTATTTGCCCATCAACCCTAGTTGGACACTGGGCCTTTGAGATAGAGAAATATATTGATGTTTCAGTTATATCTTGTCTTCAGTACGTTGGTTCTGCTCAGGACCGAATGATTCTTCGAGATCATTTTTGCAAGCATAACGTCATTATAACATCATATGATGTTGTCCGTAAAGACATCGATTATTTGGGACAGCTTCTGTGGAATTACTGCATTTTAGATGAAGGGCATATAATCAAGAATGCTAAGTCCAAAGTTACACTTGCTGTAAAGCAGTTAAAAGCCCAACACCGCTTGATATTGAGTGGGACTCCAATACAGAATAACATCATGGACTTGTGGTCCCTTTTTGATTTTCTAATGCCTGGATTTCTTGGAACAGAGAGACAGTTCCAAGCAACATATGGAAAACCGCTTTTAGCTGCTAGAGATCCAAAATGCTCAGCCAAAGATGCCGAAGCAGGAGCACTTGCTATGGAGAGATTGCATAAGCAGGCTATGCCTTTTCTCCTTCGTAGAACAAAGGATGAGGTCTTATCTGATCTTCCGGAGAAAATAATTCAAGACAGATACTGTGATTTGAGCCCTGTACAATTTAAACTCTATGAACAGTTTTCTGGTTCTCACGTAAAACAAGAAATGTCATCTATTGTGACAACGAATGAGTCAGCTGGAGCAGAAGGAAGCAGCAGTTCAACTAAAGCATCTTCACATGTTTTCCAGGCACTTCAATACTTGCTGAAGCTGTGTAGTCATCCATTGCTTGTCATCAATGATAAGGTTTTAGATTCACTTTCGACCATCCTCTCCGGAGTGTTACCGGGTGTTTCTGACATCATTTCAGAACTTCACAACCTCCATCATTCGCCAAAATTAGTTGCTCTTCAGGAGATTCTTGAAGAGTGTGGAATTGGAGTTGATGCTTCTGGTTCTGAGGGTTCTGTTAACGTTGGGCAGCACAGGGTATTGATATTTGCTCAACACAAGGCTTTTCTGGATATAATAGAAAGAGATTTGTTTCAGACACACATGAAAAATGTTACATACTTGCGGCTAGATGGATCAGTTGAGCCAGAAAAGCGTTTTGACATTGTCAAAGCATTCAATTCAGATCCTACCATTGATGTCTTACTGCTTACAACACACGTGGGTGGGCTTGGTTTGAACCTGACATCTGCAGATACCCTTGTTTTTGTGGAGCATGACTGGAATCCAATGCGAGATCATCAGGCCATGGATAGAGCGCACAGGTTAGGTCAGAAAAAAGTTGTTAATGTCCACCGTCTAATAATGCGTGGTACTTTGGAAGAGAAGGTTATGAGTCTGCAAAGGTTTAAAGTGTCTGTGGCTAATGCTGTGATTAATGCGGAGAATGCTAGCATGAAGACTATGAATACGGATCAGTTGCTTGATTTGTTTGCTTCAGCAGATACCTCCAAGAAGGGTTCTTCTGCCGGGAAGAGTTTGGAAAACAACCCTGAAGGAGACGCTAAACTAGTAGGTACCGGAAAGGGGCTGAAAGCCATACTTGGGGGATTGGAAGAGCTATGGGACCAATCACAATACACAGAAGAGTACAATCTAAATCAATTCTTAGCAAAACTAAATGGTTAA 6102 0.4136 MAQQSSRLQRLLTLLDTGSTQATRLTAARQIGEIAKSHPQDLNSLLKKVSQYLRSKNWETRVAAAHAIGSIAENVKHTSLNELTTSVVSKISENGISCSVEDLCAWTYLQSRITGSSFRSFDMSKVLEFGALLASGGQEYDIVSDNIKNPKERLVRQKQNLRRRLGLDVCEQFMDMSDVIRDEDLMTHKSDSHPNGIDHGVFTSSSVHNIQKMVANMVPSVKSKWPSARELNLLKRKAKINSKDQTKSWVEDGVTDPSGAQNSASKATYPESVNYNKVFMDVNHDDDGLDHDGDGQWPFHTFVEQLIIDMFDPVWEIRHGSVMALREILTHQGASAGVLKHDLRLGGNFMVELEDKSITIEKKEDSSMSNILKREREIDLNMQVSADEFDSNLKRPKLENVTSSTSMDSLITCSNEGDTKISISSETSGYNLPLDCVNGQFNCNSVEMDVVSFSDGLQDACKEPTIVAEQKGYSEEIPSRNLNVLRNVPQNSELMSMVKVARSSWLRNCGFLQDCVIRFLCVLSLDRFGDYVSDQVVAPVRETCAQALGAAFKYMHPALVNETLNILLKMQCRPEWEIRHGSLLGIKYLVAVRQELLPDLLGLVLPACKSGLEDPDDDVRAVAADALIPASAAIVALQGPTLRTIVMLLWDILLDLDDLSPSTSSVMNLLSEIYSQEEMVPMMYEVLRLGDNGISIQNGVGGGDDDDDDEENPYVLSTLAPRLWPFMRHSISSVRYSAMRTLERLLEAGYKRRMSELSNASFWPSIIFGDTLRIVFQNLLLETNEDILQCSERVWSLLVKCSVEDLDTAARSYMASWIELASTPFGSALDASKMFWPAAFPRKSQFRAAAKMRAVKSEYDYGGDFGLDSTKGSVPQERNGDAAMDSVKIVVGADVDTSVTHTRVVTATALGIFASKLPEDSLKYVVDPLWRSLTSLSGVQRQVASMVLISWFKEIKRTNSSXDSLLPYAELSRTYSKMRNEAGQLLNAIKSSGMFDELLSTTKIELDSLSVDGAISFASKVPAVCNDSSLNESLIKNTLDDIESTKQRLLTTSGYLKCVQSNLHVTVSSAVAAAVVWMSEYSSRLTPIILPLMASIRREQEEILQMKSAEALAELIYHCVARRPCPNDKLIKNICSMTCLDPSETPRAKLICSMESIDDQGLLSFGTPVSKHKSKVHVLAGEDRSKVEGFISRRGSELALRLLCEKFGALLFDKLPKLWDCLTEVLKPSSSESPAVTNEKQATMAVESISDPQILINNIQVVRSIAPLLNEELKPKLLTLLPCIFKCIQHSHVAVRLAASRCITSMARSMTVKVMSAVVENAIPMLEDASSVHARQGAGMLINFLVQGLGVELVPYAPLLVVPLLRCMSDCDQSVRQSVTHSFAALVPLLPLARGLPQPVGLGEGISRNADDLQFLEQLLDNSHIEDYKLCTELKVTLRRYQQEGINWLAFLKRFKLHGILCDDMGLGKTLQSSAIVASDIAEHRTQSGNGDLLPSLIICPSTLVGHWAFEIEKYIDVSVISCLQYVGSAQDRMILRDHFCKHNVIITSYDVVRKDIDYLGQLLWNYCILDEGHIIKNAKSKVTLAVKQLKAQHRLILSGTPIQNNIMDLWSLFDFLMPGFLGTERQFQATYGKPLLAARDPKCSAKDAEAGALAMERLHKQAMPFLLRRTKDEVLSDLPEKIIQDRYCDLSPVQFKLYEQFSGSHVKQEMSSIVTTNESAGAEGSSSSTKASSHVFQALQYLLKLCSHPLLVINDKVLDSLSTILSGVLPGVSDIISELHNLHHSPKLVALQEILEECGIGVDASGSEGSVNVGQHRVLIFAQHKAFLDIIERDLFQTHMKNVTYLRLDGSVEPEKRFDIVKAFNSDPTIDVLLLTTHVGGLGLNLTSADTLVFVEHDWNPMRDHQAMDRAHRLGQKKVVNVHRLIMRGTLEEKVMSLQRFKVSVANAVINAENASMKTMNTDQLLDLFASADTSKKGSSAGKSLENNPEGDAKLVGTGKGLKAILGGLEELWDQSQYTEEYNLNQFLAKLNG 2034
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Arst3g01354 2034 SUPERFAMILY ARM repeat 6 1478 IPR016024 -
Arst3g01354 2034 SUPERFAMILY P-loop containing nucleoside triphosphate hydrolases 1423 1670 IPR027417 -
Arst3g01354 2034 CDD SF2_C_SNF 1785 1927 - -
Arst3g01354 2034 CDD DEXHc_Mot1 1438 1670 IPR044078 GO:0005524
Arst3g01354 2034 SMART helicmild6 1830 1916 IPR001650 -
Arst3g01354 2034 PANTHER TATA-BINDING PROTEIN-ASSOCIATED FACTOR 172 4 2033 IPR044972 GO:0003677|GO:0016887|GO:0017025
Arst3g01354 2034 PANTHER TATA-BINDING PROTEIN-ASSOCIATED FACTOR 172 4 2033 IPR044972 GO:0003677|GO:0016887|GO:0017025
Arst3g01354 2034 Gene3D - 1680 1992 IPR027417 -
Arst3g01354 2034 ProSiteProfiles Superfamilies 1 and 2 helicase C-terminal domain profile. 1806 1962 IPR001650 -
Arst3g01354 2034 SUPERFAMILY P-loop containing nucleoside triphosphate hydrolases 1671 1972 IPR027417 -
Arst3g01354 2034 Gene3D - 1411 1669 IPR038718 -
Arst3g01354 2034 Pfam SNF2-related domain 1456 1752 IPR000330 GO:0005524|GO:0140658
Arst3g01354 2034 MobiDBLite consensus disorder prediction 248 267 - -
Arst3g01354 2034 SMART ultradead3 1434 1631 IPR014001 -
Arst3g01354 2034 FunFam TATA-binding protein-associated factor BTAF1 516 696 - -
Arst3g01354 2034 FunFam B-TFIID TATA-box-binding protein-associated factor 1 1408 1669 - -
Arst3g01354 2034 Gene3D - 1172 1396 IPR011989 -
Arst3g01354 2034 FunFam TATA-binding protein-associated factor 172 1680 1997 - -
Arst3g01354 2034 ProSiteProfiles Superfamilies 1 and 2 helicase ATP-binding type-1 domain profile. 1450 1620 IPR014001 -
Arst3g01354 2034 Pfam Domain of unknown function (DUF3535) 794 1221 IPR022707 -
Arst3g01354 2034 Gene3D - 515 692 IPR011989 -
Arst3g01354 2034 FunFam TATA-binding protein-associated factor BTAF1 1 96 - -
Arst3g01354 2034 FunFam TATA-binding protein-associated factor BTAF1 1185 1396 - -
Arst3g01354 2034 Gene3D - 4 94 IPR011989 -
Arst3g01354 2034 MobiDBLite consensus disorder prediction 251 267 - -
Arst3g01354 2034 Pfam Helicase conserved C-terminal domain 1814 1916 IPR001650 -
       

Duplication type information


Select Gene Chromosome Start End Duplicated_type
Arst3g01354 Arst-Chr3 11541359 11559279 Dispersed/Transposed
       

Functional genes information


Select Gene Gene_start Gene_end Function Ath_gene Identity(%) E-value Score
Arst3g01354 1433 1971 RAD5 or RAD16-like Gene Family AT1G02670 26.316 4.90e-43 167
       

Transcription factors information


Select Regulatory Factors Family Gene Hmm_acc Hmm_name E_value Clan
TR SNF2 Arst3g01354 DUF3535 1.1e-95 No_clan
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Arst3g01354 K15192 - gmx:100777977 3353.92