Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Arst4g01541 ATGGAGATGAGCTGCCAAGAAAATGTTGGTGGTAGTGATATTCCTGCATGTTCTACAGCAGGGAATATTTCACATCAAGACCAGCGTTTCAGTAGCTATGTGCAGCAGCCTGCTTTTGTGAGTGGATGGATGTATGTGAATGAACAAGGGCAAATGTGTGGTCCATATATTCAGGAGCAACTATATGAGGGTTTAACTACTGGTTTCTTGCCATTTGAGCTTCCTGTCTATCCTATGATTAATGGCACAATAATGAACCCTGTGCCACTGAATTACTTCAAGCAGTTTCCTGACCATGTCTCTACTGGGTTTGCATATCTGAATTTGGGTTTCTCTGGCTCAGGGGTTCATACAAATGGTCCCTCCTCATCTATGGATATGGCAATATATGGGCAGAATCAGTCTCTTGACAATGCTGCTCCTTTGGCTGTTAACTCTGGCTCGCAGTCAGTTCCGTATTCACATGGCAATTACTGCATTAATGAACCTAACCACCAATGTTCAAAGTCAGAGTTGTTCAACAGTATGATATCTTCTCAGATGTTAGGTGAAGAACGTTGTTGGCTTTATGAGGATGAAAAGGGCATGAAACACGGTCCACATTCTATTTCAGAATTGATTTCCTGGCATCATCATGGATATCTTCAAGACTCGTCAATGATATCTCATTTTGGCAATAAGTATGGTTCCTTTCTGCTAGTTGCTGCTGTAAATGCTCTGAAAGGGGATACATCTGGAACCATTTGTAGATCAGGTTCCAACAACAATGAGCTTGGTGGTACGGTAAACTTAATCTGTGAAATATCTGAGGACATTTCTTCCCAATTGCACTTGGGTGTCATGAAATCTGCCCGCAGAGTTGTGCTCGATGGGGTTATAAGTGATATTATTGCAGAATTTTTTACTGAAAAGAAACACAAGAGGCAGAAGCTTGATTCAACTAACCTGGCTTCTGAGACCTCCGTGGTTGACAGCAAAAGGTCAAATGTTGCTGTGGAGATTAGTAAAGATACTGCTGTTCCGAGTGAGCATGCATCTTCCCATATTGCAGATGATCAGACTTGTCTTGAAATTTCTAGACCATCTTCAACAAGTTTTAAATCGGTTGGAAGTGTTGAGAACTTTTGGTGGTCGTATGCTGTTGTACGTAAAGTACTTGTTGATTATTGCATGCAAGTCACGTGGAATGCTGTATTTTTTGACCCTCTAGCAGAGTACTTATGTTCCTGGAGGAAAAAGAAACTTTGGTCTCATCCTAATCTTCAAATATTTGTTAATGGTTGTGGAGAATATGATGGAAAGATTAAATCTGAAGCTTTGCTTCTTGGGACAGGTTGTTCTAAATACCCTACCGATGGCTGTAATCAGTTTGGAGTGCTGACAACAGGAACAGATTCTCATTCTAAGTTACCTTCTTTATCTTCTAATGTACCCAAAGATGGAAATTTAATGGAAGGTCAAAGAGTTTCATGCACTTATAGTAACTCCAAAAATTTGACATGCATTATTGAAAGTGTGGAGAATGAGCTTCACTTCTCTTCAAAGGTGTCTTTAGCTGAATACTTTCGAAGTTTTGTTGAAAAGGAAGTGAACAAAATAATTCATTCCCCTCATGAAGACAAATTGAGTAAGGTTGCTGTTAGTGTCTCTGGTTTCTCAGAACTACACACTGGTGAAACTCCTATGAAGGAAATTCTAAATGACAAGTTAGTAGCTACTGTTAAGGCTGAAGATTCAGTTTGTGAGCCTTCATTAGCAAATCATATGTCTAATGTATTCTCGAATGCATTTGAGGAGTTGTGTGGAGGTGTAGAAGTAGTTGATGAAGGGGAAATTGGTGAACTTCCACCTGGATTTGAAGAGAACTCGCACACAATTTTCCCACCACCGAATTTGAAGATTCGACCTTCAAGGGTAGCTGAATGTAATCCTAAGATTACAGAATATGTTGCAACTGCACTGTGCCGACAAAAGTTGCATAATGAAGTCCTAGAAGAGTGGAAATCCGCTTTTTTTTATCCTACATTGAATCAAGCTTTTATGTCTAATAAGAAACGCAGTCATTCTGGTATTCATGAGAAAGGAAAAGCAAAGAAAGCAAGGAAGGAACCTTTAAATGATGCTACTTCTGGACTGGGAAAGATGAAAGGAGGAGCAAAAGGCTCTTCTGCAGTTCCTCTAGTTAATGGAAAGTATACATACCACCGCAAGAAACTGTCACACAAAGAGTTGTGTTCCTCTCAATCTGCTTCAGTGGATGATTCGAGGCCAGGGAAGCAGAACGTGTGTAAGTTAAGGAATTATGTTTTGGGAGATTTGAATGAAACTGCAGAAGTCAAAATAGCTGCTAAGCGTGGAAAGGCTAGTGTGGTTAAAGGGAAGAAAGATAAATCTAATAAGAGCAGGTCATCTATCATTGTCAATGGTAGCACAGATGGTGATCGATTGTCCTTGAAGAATAAAACTAGTTCGAAAGCATTGAAACTTTCACATACTGATGGTGTTGTGGATGCCGTAAAATCTAATGAAAGGAGGCTTTCTGCGTCAACAAATAATAGTGTTGGAATGAAGAAGGTGGTTAAAAGCAATGCTGAGGATGTCCTAAAATCAAATGAAAAGAAGCTTTCAGCATCAACAAATAATAGTGTTTCCATGAAGAAGGTGGCTAAAAGCAATTGCAGTGATGGCACCATTAAGGGGAAGGCTGCTGGTCATTGCTCCAAACAGAGACCAAGTGCAAATAAAATGTCGAAACAAAAAAGGAAACATTCAACAGATGGTATGCCATCCTTACATCCTGCCAAGTCTTTGAAAATTTCAAACGATGGTGCAAAGCATGGAGCAAGCAAACATGCTACTATTGCAAGGAGGAATTCTGCCAAATCCAAGCCATTGAATTTATGTCCTAGATCTGATGGATGTGCTAGGACTTCCATTGATGGGTGGGAATGGCATAGATGGTCTCAAAGTGCTACTCCTGCATATAGAGCACGTGTCAGGGGCATTGCCTGTATACAAAACAAGTGCATAGATTCAGATAATAATTTATTGCAGCTATCGAATAATAAGGGTCTTTCTGCAAGAACAAATAGGATGAAATTGCGCAACCTTCTTGCTGCTGCCGAGGGTGCAGATCTTCTGAAAGTGCCTCAATTGAAGGCAAGGAAAAAACGATTACGTTTTCAAAGGAGCAAGATACATGACTGGGGTCTTGTTGCAATGGAGCCTATAGAGGCAGAGGACTTCGTGATTGAATATGTTGGAGAACTAATTCGTCCTCGGATATCTGATATCCGTGAACGTCAGTATGAGAAGATGGGAATTGGAAGCAGTTACCTTTTCAGACTTGATGATGGTTATGTGGTTGATGCTACAAAGAGAGGTGGGATTGCAAGATTTATTAACCATTCTTGTGAGCCCAACTGCTATACAAAGGTCATCTCTTTCGAGGGTCAAAAGAAGATTTTCATATATTCAAAACGGCATATTGCTGCTGGTGAAGAGATTACTTATAATTATAAGTTCCCGTTGGAGGATAAAAAGATTCCCTGCAACTGTGGTTCCAGAAAGTGTCGTGGATCACTTAATTAG 3591 0.4013 MEMSCQENVGGSDIPACSTAGNISHQDQRFSSYVQQPAFVSGWMYVNEQGQMCGPYIQEQLYEGLTTGFLPFELPVYPMINGTIMNPVPLNYFKQFPDHVSTGFAYLNLGFSGSGVHTNGPSSSMDMAIYGQNQSLDNAAPLAVNSGSQSVPYSHGNYCINEPNHQCSKSELFNSMISSQMLGEERCWLYEDEKGMKHGPHSISELISWHHHGYLQDSSMISHFGNKYGSFLLVAAVNALKGDTSGTICRSGSNNNELGGTVNLICEISEDISSQLHLGVMKSARRVVLDGVISDIIAEFFTEKKHKRQKLDSTNLASETSVVDSKRSNVAVEISKDTAVPSEHASSHIADDQTCLEISRPSSTSFKSVGSVENFWWSYAVVRKVLVDYCMQVTWNAVFFDPLAEYLCSWRKKKLWSHPNLQIFVNGCGEYDGKIKSEALLLGTGCSKYPTDGCNQFGVLTTGTDSHSKLPSLSSNVPKDGNLMEGQRVSCTYSNSKNLTCIIESVENELHFSSKVSLAEYFRSFVEKEVNKIIHSPHEDKLSKVAVSVSGFSELHTGETPMKEILNDKLVATVKAEDSVCEPSLANHMSNVFSNAFEELCGGVEVVDEGEIGELPPGFEENSHTIFPPPNLKIRPSRVAECNPKITEYVATALCRQKLHNEVLEEWKSAFFYPTLNQAFMSNKKRSHSGIHEKGKAKKARKEPLNDATSGLGKMKGGAKGSSAVPLVNGKYTYHRKKLSHKELCSSQSASVDDSRPGKQNVCKLRNYVLGDLNETAEVKIAAKRGKASVVKGKKDKSNKSRSSIIVNGSTDGDRLSLKNKTSSKALKLSHTDGVVDAVKSNERRLSASTNNSVGMKKVVKSNAEDVLKSNEKKLSASTNNSVSMKKVAKSNCSDGTIKGKAAGHCSKQRPSANKMSKQKRKHSTDGMPSLHPAKSLKISNDGAKHGASKHATIARRNSAKSKPLNLCPRSDGCARTSIDGWEWHRWSQSATPAYRARVRGIACIQNKCIDSDNNLLQLSNNKGLSARTNRMKLRNLLAAAEGADLLKVPQLKARKKRLRFQRSKIHDWGLVAMEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVISFEGQKKIFIYSKRHIAAGEEITYNYKFPLEDKKIPCNCGSRKCRGSLN 1196
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Arst4g01541 1196 CDD SET_SETD1 1045 1192 IPR037841 GO:0042800|GO:0048188
Arst4g01541 1196 ProSiteProfiles Post-SET domain profile. 1180 1196 IPR003616 -
Arst4g01541 1196 SMART PostSET_3 1180 1196 IPR003616 -
Arst4g01541 1196 SMART set_7 1057 1180 IPR001214 GO:0005515
Arst4g01541 1196 Gene3D - 174 255 IPR035445 -
Arst4g01541 1196 MobiDBLite consensus disorder prediction 900 960 - -
Arst4g01541 1196 PANTHER HISTONE-LYSINE N-METHYLTRANSFERASE SETD1 3 1196 IPR044570 GO:0042800|GO:0051568
Arst4g01541 1196 ProSiteProfiles GYF domain profile. 185 241 IPR003169 GO:0005515
Arst4g01541 1196 SUPERFAMILY GYF domain 180 237 IPR035445 -
Arst4g01541 1196 Gene3D SET domain 987 1196 IPR046341 -
Arst4g01541 1196 MobiDBLite consensus disorder prediction 684 720 - -
Arst4g01541 1196 Pfam SET domain 1069 1173 IPR001214 GO:0005515
Arst4g01541 1196 ProSiteProfiles SET domain profile. 1057 1174 IPR001214 GO:0005515
Arst4g01541 1196 SUPERFAMILY SET domain 1052 1193 IPR046341 -
       

Duplication type information


Select Gene Chromosome Start End Duplicated_type
Arst4g01541 Arst-Chr4 21594594 21604922 Dispersed/Tandem
       

Functional genes information


Select Gene Gene_start Gene_end Function Ath_gene Identity(%) E-value Score
Arst4g01541 1056 1196 C2H2 Transcription Factor Family AT2G23740 34.969 2.70e-17 86.3
       

Transcription factors information


Select Regulatory Factors Family Gene Hmm_acc Hmm_name E_value Clan
TR SET Arst4g01541 SET 4.8e-20 No_clan
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Arst4g01541 - - aip:107636228 2306.56