Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Arst1g04313 ATGGGACACAAGAAGCGAAATTCGGCTCCGCGGTCCAAACATTCGCCGGCGACTTCTCCGGCGGCGCAATTCGCCGTCGTCGGCGACGGTATCGCATCGCCGGAGAAACAAGACTCAGGCAATGTTCCCGATCACAACAGCAACACCAATAACGGCGTTCAGAACCCTAACGGGATCGAGTTTCCTCCTCCGCCACCGCCGCCGCCTCCTCCGCCTCAATCCGATGGTCCTTCCTCCGAGTACACCGCGATCAAGCTCGAGTGTGAGCGTGCCCTAACGGCGTTCCGGCGCGGGAACCACAACAGAGCACTGAAGCTCATGAAGGAGCTATGCGGGAAACAAGAGAACGCTGCTCACTCCGCATTTGCGCACCGCGTCCAGGGATTCGTCGCCTTTAAGGTCGCCGGAATCTTGAACGACCCTACCGCGAAGCAACGGCACGTGAAGAACGCCGTCGATTCGGCTCGCAAGGCCGTCGAGCTGTGCCCTAATTCGGTCGAGTACGCGCACTTCTACGCCAATTTGATGCTCGAGGTCGCGAACGACGGGAAGGATTATGAGGAGGTAGTGCAGGAGTGCGAGAGGGCGCTGGCGATAGAGAACCCTAGCGATCCAGGGAAGGAGAGCTTCCAGGATGAGAGTGATCAGAAGTTGTCGACTCCCGAGGCACGTGTTTCGCATGTGCAAAACGAGCTGCGGCAGCTGATTCAGAAGTCGAACATCGCTTCGCTGTCGAGCTGGGTGAAGAATCTGAGCAACGGCGAGGAGAGATTTCGGCTGATTCCGATTCGGAGGCCGACGGAGGACCCTATGGAGCTGAGACTGGTTCAGACTCGAAGGCCTAACGAGATCAAGAAAGTGACGAAGACGCCGGAAGAGAGGAGGAAGGAGATTGAAGTGAGAGTCGCCGCTGCGAGGCTGTTGCAGCAGAAGTCTGAGTCGCCCCAATCTCCGAACGAAGTAGATAAGGATGAGAGGGGGCTGGATTCATCTTCAGCGACTGGTCAGAGTAGGAGGAGGCATGTGAATTCGAAGAAGAATGGGTCTACTGCTGAGAGGAGGAAATGGGTGCAATCTTACTGGAGTTCGATGAGTGTGGAAATGAAGAAGGACTTGCTTAGGGTTAGGGTTTCTGATCTTAGGTCACACTTTGGGTCTTCGAAGGATACTTTGCCTATTGATCTTTTGGCGGAGGCTGTGTCGTATGCTGAGGCCAATAAGACATGGAAATTCTGGGTTTGCTGCCATTGTGATGAGAAATTTAACGATGACGAGTCTCACAGCCAACATCTTGCGCAGGCGCACATGGGTATGCTCTCGCCGAATTTGCAGGGGCATCTGCCCCATAATGTTGACAGTGAGTGGATTGATATGATTCATAATTGTTCTTGGAAGCCTCTGGATGTTTCTGCTGCAGTTAAAATGCTTGAGAGTAGAATGAAATTCAAAGGTTCAACATTAGTTGAGGATTCGTACTTCGATCATCATACACAGGACTACTATGACTGTATTAATGATGCAACTGATCCTTACCATGAGAAAGAAGGTTTGGGATACAGTCTTCATAACTGTACAACAGAAAGCAATAACTATTGTAAAGCTGTTGCAAGTAATGTGAGAGAAGGCGTTGAAAACCAATTATCTATGTCGCATCCTTTCGCTGATAGTTGGCCAGTATCTGATGATTCTGAGCGCGCAAAACTTCTGGAGAAAATTCGTGCAGTATTTGAGATGCTTATTAAGCATAAATATCTTGCTGCTAGTCACCTTAGCAAGGTTATACAATTTACTATGGGTGAGATTCAAGGTCTTGCTGCTGGTTCTCAATTTCTAAACCATGGTGTCGACCAAACGCCAATGTGCATATGCTTTCTGGGGGCATCACAGCTTAAGAAAATTCTCCAATTTCTTCAAGAGCTATCTCATGCATGCGGATTGGGTAGATATGGTGATAAAGGTAATGGTCTCATGAATGAGTTTCATGATATCAATCAAGGTCCTGAGATCAAAGAGAATATTGTTCTCAATGGAGATTCATCATGCCTCCTTCTGGATGAGTGTTTACTGGTTACACAAGTCACTTTTGATGCTGCTCAGGGGACTGTATTGGATAATATGACTGCCCAAAGTTCTCATGATGGTATTTCAAGCGATAATGATGATTTTCTATCCTGGATATATTCAGGTTCAGCTATAGGGGATCAATTGACATCATGGATGCGAACCAAAGAAGATAATAAACACCAAGGTACAGAAATTATCAAGATGCTTGATAAGGAGTTTTATCAACTACAGACCCTATGTGAGAAGAAGTCTGACCGAATGAGTTATGAGGAAGCACTGCAGACAGTAGAGGATCTTTGTCTTGAAGAGGGAAAGAAGAGGGAAATTGTTGGTGACTTTGTCCAGCAAAGCTATGAGTCTGTCTTAAGAAAACGAAGAGAAGAGCTCATTGAAAGTGAGAATGATGTGATGAATGTCGGCAATAGGTTTGAGTTGGATGCCATATCAAATGTTTTGCAAGAAGCAGAATCAATGAATGTTAATCAATTTGGATATGAAGAAACTTATGCTGGCGTGACTGCTCAGTTATGTGACTTGGAATCTGGTGAAGAAGAAGAATGGAGAATGAAAGACTACTTGCATCAAATGGATGGTTGTATAGAAATTGCTATTCAGAAACTGAAAGAGCACTTGTCTATAGAGCTTAGCAAAATTGATGCTCGAATCATTAGAAATATTGCTGAGATGCAACAATTCGAACTCAAGCTTGGGCCTCTTTCCGCTTATGATTATCGGGCTATATTATTGCCTCTAGTGAAGTCATACCTAAGGGCACGTTTAGAAGAATTTGCCGAGAAGGATGCAGTAGAGAAGTCTGATGCTGCGAGGGAAGCATTCTTGGCTGAACTTGCACGTGATGCTAAGAAGGCTAAAGGGGGAAGTGAGAACACAAGAAATGTGGATAAGACCAAAGATAAGAAGAAGACTAAAGATCACAGAAAAACAAAAGATCTGAAGGCTTCAAGTGGTCATGAGGAGCTATTGCTGCAAGCTAGCAGTCCTGATTCCAATACTGTTGCACCTGATAGTTACTTTCAAGATCCTGAGCTTGTTTCCATGAATGATAATTACTTGGAACAACAGGAAGAGGAATATAGACGGAAAATAGAGCTAGAAGAGGAGGAAAAAAAGCTTGAGGAAACTTTAGAATTTCAGCGTAGGATAGAAAATGAAGCCAAACAAAAACACCTTGCCGAATTACAAAAGAAATCGTCTGGGATATGTTTAGAGGAAGTTGCGGACAAAATTCAGGATGCTCAGTTGAAGACAGTTGCTGATGGGCCAGATGTGCATGATCATGTAAAACTGCCTATACAGTCAGCTGATGAGAATTGCTGTCCTAGTGAAGTGGATAGTGTGATAGTCACCACTAAAAATGGTTCTTTGGTGCCAAATAAATATTCAGTTGATTCAGCTGATCAAAAGATATTGCATCAGCCAACTGTTAAACAAGCAGGTATACCTAATGGAGTTGTTCCAGAGAATGGTCATCAGTTGCCTGATCGTCGTGCAGGGAAAAAGCATAAACGGCATAGGAATTCTTCCAAAATGGTTGATGGAAAATTGGAATCTGTCTCATTGGAAAAGAATATTGAGGATGCACATACTGACAGACATTCAAGAGAGCATGTTAAATTCCATAATGATCAAGATGCAAACAATGGGTGGGAAAGCAATGTATCAAAGGCGAAGAAAGATCTGCAAATGGAAGATGAGGAGGAGGAAAGATTCCAAGCTGATCTTAAAAAGGCTGTACGACAAAGCCTGGACACATATCAAGCACGTGGAAAACTGCCTTTGGATTCTAGTTTAAGAATGTCTCAGAGATCTGCTTCACAAGTAGATTCATTGGGTTTTCCAACACAGAAAGACTCAACTGAGGATGCAAATGGAACTACATTGCTTGGTACTGGGCTAAAGAATGAAGTTGGTGAATATAACTGTTTTCTCAATGTTATTATACAGTCTTTATGGCATTTAAGACGCTTTCGGGAGGAATTTCTTGGCAGATCAAGATCAGAGCATGATCATGTTGGCAATCCTTGTGTCGTCTGTGCATTGTATGAGATCTTCACTGCTTTGGACCTTGCATCAAAGGACTCAAGGAGAGAAGCAGTAGCACCTACTTCCCTGCGAATAGCTCTAAGCAACCTATATCCAGATAGTAACTTCTTCCAGGAGGCTCAGATGAACGATGCTTCTGAGGTACTTGCAGTGATATTTGACTGCCTTCATCGTGCATTTACCCGTGGTTCAAGTGTTTCTGACACTGAGTCGGTGGAAAGTAATTGCATGGGATCTTGGGATTGTGCAAATAATACTTGTATAGCACATTCACTTTTCGGAATGGACATTTTTGAGCAAATGAACTGCTATCACTGTGGTCTCGAGTCCAGACATTTGAAGTATACATCCTTCTTTCACAATATAAATGCCAACGCATTACGAACAATGAAGGTTATGTGTTCTGAAAGTTCCTTTGATGAGCTATTGAACCTTGTGGAGATGAACCATCAATTGGCTTGTGATCCAGAAGTTGGTGGTTGTGGCAAGCTTAACTACATCCATCACTTTCTTTCAACTCCACCTCATGTTTTTATGACAGTTCTTGGTTGGCAAAATACATGCGAGAGTGCTGATGATATAACAGCCACTGTGGCAGCCCTGAGCACTGCACTAGACATCAGTGTCCTATATCGAGGCTTAGATCCTAAAAGAACTCATAGCTTGGTATCAGTGGTTTGCTACTATGGCCAACATTATCATTGCTTTGCTTACAGTCATGACCATGATCAATGGATTATGTATGATGACAAGACTGTCAAGATAATTGGTGGATGGGCAGATGTTCTTACAATGTGTGAAAAAGGACATCTACAACCTCAGGTTCTTTTCTTTGAAGCTGTAAACTAG 4977 0.4416 MGHKKRNSAPRSKHSPATSPAAQFAVVGDGIASPEKQDSGNVPDHNSNTNNGVQNPNGIEFPPPPPPPPPPPQSDGPSSEYTAIKLECERALTAFRRGNHNRALKLMKELCGKQENAAHSAFAHRVQGFVAFKVAGILNDPTAKQRHVKNAVDSARKAVELCPNSVEYAHFYANLMLEVANDGKDYEEVVQECERALAIENPSDPGKESFQDESDQKLSTPEARVSHVQNELRQLIQKSNIASLSSWVKNLSNGEERFRLIPIRRPTEDPMELRLVQTRRPNEIKKVTKTPEERRKEIEVRVAAARLLQQKSESPQSPNEVDKDERGLDSSSATGQSRRRHVNSKKNGSTAERRKWVQSYWSSMSVEMKKDLLRVRVSDLRSHFGSSKDTLPIDLLAEAVSYAEANKTWKFWVCCHCDEKFNDDESHSQHLAQAHMGMLSPNLQGHLPHNVDSEWIDMIHNCSWKPLDVSAAVKMLESRMKFKGSTLVEDSYFDHHTQDYYDCINDATDPYHEKEGLGYSLHNCTTESNNYCKAVASNVREGVENQLSMSHPFADSWPVSDDSERAKLLEKIRAVFEMLIKHKYLAASHLSKVIQFTMGEIQGLAAGSQFLNHGVDQTPMCICFLGASQLKKILQFLQELSHACGLGRYGDKGNGLMNEFHDINQGPEIKENIVLNGDSSCLLLDECLLVTQVTFDAAQGTVLDNMTAQSSHDGISSDNDDFLSWIYSGSAIGDQLTSWMRTKEDNKHQGTEIIKMLDKEFYQLQTLCEKKSDRMSYEEALQTVEDLCLEEGKKREIVGDFVQQSYESVLRKRREELIESENDVMNVGNRFELDAISNVLQEAESMNVNQFGYEETYAGVTAQLCDLESGEEEEWRMKDYLHQMDGCIEIAIQKLKEHLSIELSKIDARIIRNIAEMQQFELKLGPLSAYDYRAILLPLVKSYLRARLEEFAEKDAVEKSDAAREAFLAELARDAKKAKGGSENTRNVDKTKDKKKTKDHRKTKDLKASSGHEELLLQASSPDSNTVAPDSYFQDPELVSMNDNYLEQQEEEYRRKIELEEEEKKLEETLEFQRRIENEAKQKHLAELQKKSSGICLEEVADKIQDAQLKTVADGPDVHDHVKLPIQSADENCCPSEVDSVIVTTKNGSLVPNKYSVDSADQKILHQPTVKQAGIPNGVVPENGHQLPDRRAGKKHKRHRNSSKMVDGKLESVSLEKNIEDAHTDRHSREHVKFHNDQDANNGWESNVSKAKKDLQMEDEEEERFQADLKKAVRQSLDTYQARGKLPLDSSLRMSQRSASQVDSLGFPTQKDSTEDANGTTLLGTGLKNEVGEYNCFLNVIIQSLWHLRRFREEFLGRSRSEHDHVGNPCVVCALYEIFTALDLASKDSRREAVAPTSLRIALSNLYPDSNFFQEAQMNDASEVLAVIFDCLHRAFTRGSSVSDTESVESNCMGSWDCANNTCIAHSLFGMDIFEQMNCYHCGLESRHLKYTSFFHNINANALRTMKVMCSESSFDELLNLVEMNHQLACDPEVGGCGKLNYIHHFLSTPPHVFMTVLGWQNTCESADDITATVAALSTALDISVLYRGLDPKRTHSLVSVVCYYGQHYHCFAYSHDHDQWIMYDDKTVKIIGGWADVLTMCEKGHLQPQVLFFEAVN 1658
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Arst1g04313 1658 SUPERFAMILY Cysteine proteinases 1325 1655 IPR038765 -
Arst1g04313 1658 MobiDBLite consensus disorder prediction 1 79 - -
Arst1g04313 1658 Gene3D Cysteine proteinases 1307 1656 - -
Arst1g04313 1658 MobiDBLite consensus disorder prediction 206 220 - -
Arst1g04313 1658 CDD Peptidase_C19 1416 1655 - -
Arst1g04313 1658 Pfam Protein of unknown function (DUF627) 90 201 IPR006866 -
Arst1g04313 1658 Gene3D Tetratricopeptide repeat domain 81 212 IPR011990 GO:0005515
Arst1g04313 1658 MobiDBLite consensus disorder prediction 1291 1317 - -
Arst1g04313 1658 Pfam Ubiquitin carboxyl-terminal hydrolase 1326 1633 IPR001394 GO:0004843|GO:0016579
Arst1g04313 1658 Pfam Protein of unknown function (DUF629) 357 889 IPR006865 -
Arst1g04313 1658 MobiDBLite consensus disorder prediction 307 351 - -
Arst1g04313 1658 MobiDBLite consensus disorder prediction 978 994 - -
Arst1g04313 1658 Coils Coil 1255 1275 - -
Arst1g04313 1658 MobiDBLite consensus disorder prediction 1014 1028 - -
Arst1g04313 1658 MobiDBLite consensus disorder prediction 1173 1203 - -
Arst1g04313 1658 ProSitePatterns Zinc finger C2H2 type domain signature. 414 435 IPR013087 -
Arst1g04313 1658 MobiDBLite consensus disorder prediction 1288 1317 - -
Arst1g04313 1658 MobiDBLite consensus disorder prediction 58 76 - -
Arst1g04313 1658 Coils Coil 1042 1083 - -
Arst1g04313 1658 MobiDBLite consensus disorder prediction 200 221 - -
Arst1g04313 1658 PANTHER UBIQUITIN SPECIFIC PROTEINASE 1 1657 - -
Arst1g04313 1658 PANTHER UBIQUITIN SPECIFIC PROTEINASE 1 1657 - -
Arst1g04313 1658 MobiDBLite consensus disorder prediction 978 1028 - -
Arst1g04313 1658 MobiDBLite consensus disorder prediction 39 57 - -
Arst1g04313 1658 ProSiteProfiles Ubiquitin specific protease (USP) domain profile. 1325 1657 IPR028889 -
       

Duplication type information


Select Gene Chromosome Start End Duplicated_type
Arst1g04313 Arst-Chr1 105802981 105812874 Dispersed/Tandem
       

Functional genes information


Select Gene Gene_start Gene_end Function Ath_gene Identity(%) E-value Score
Arst1g04313 84 1655 C2H2 Transcription Factor Family AT3G47890 56.828 0.0 1702
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Arst1g04313 - - adu:107493676 3269.94