Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Arst4g01542 ATGGTCCCTCCTCATCTATGGATATGGCAATATATGGGCAGAATCAGTCTCTTGACAATGCTGCTCCTTTGGCTGTTAACTCTGGCTCGCAGTCAGTTCCGTATTCACATGGCAATTACTGCATTAATGAACCTAACCACCAATGTTCAAAGTCAGAGTTGTTCAACATTAGGTGAAGAACGTTGTTGGCTTTATGAGGATGAAAAGGGCATGAAACACGGTCCACATTCTATTTCAGAATTGATTTCCTGGCATCATCATGGATATCTTCAAGACTCGTCAATGATATCTCATTTTGGCAATAAGTATGGTTCCTTTCTGCTAGTTGCTGCTGTAAATGCTCTGAAAGGGGATACATCTGGAACCATTTGTAGATCAGGTTCCAACAACAATGAGCTTGGTGGTACGGTAAACTTAATCTGTGAAATATCTGAGGACATTTCTTCCCAATTGCACTTGGGTGTCATGAAATCTGCCCGCAGAGTTGTGCTCGATGGGGTTATAAGTGATATTATTGCAGAATTTTTTACTGAAAAGAAACACAAGAGGCAGAAGCTTGATTCAACTAACCTGGCTTCTGAGACCTCCGTGGTTGACAGCAAAAGGTCAAATGTTGCTGTGGAGATTAGTAAAGATACTGCTGTTCCGAGTGAGCATGCATCTTCCCATATTGCAGATGATCAGACTTGTCTTGAAATTTCTAGACCATCTTCAACAAGTTTTAAATCGGTTGGAAGTGTTGAGAACTTTTGGTGGTCGTATGCTGTTGTACGTAAAGTACTTGTTGATTATTGCATGCAAGTCACGTGGAATGCTGTATTTTTTGACCCTCTAGCAGAGTACTTATGTTCCTGGAGGAAAAAGAAACTTTGGTCTCATCCTAATCTTCAAATATTTGTTAATGGTTGTGGAGAATATGATGGAAAGATTAAATCTGAAGCTTTGCTTCTTGGGACAGGTTGTTCTAAATACCCTACCGATGGCTGTAATCAGTTTGGAGTGCTGACAACAGGAACAGATTCTCATTCTAAGTTACCTTCTTTATCTTCTAATGTACCCAAAGATGGAAATTTAATGGAAGGTCAAAGAGTTTCATGCACTTATAGTAACTCCAAAAATTTGACATGCATTATTGAAAGTGTGGAGAATGAGCTTCACTTCTCTTCAAAGGTGTCTTTAGCTGAATACTTTCGAAGTTTTGTTGAAAAGGAAGTGAACAAAATAATTCATTCCCCTCATGAAGACAAATTGAGTAAGGTTGCTGTTAGTGTCTCTGGTTTCTCAGAACTACACACTGGTGAAACTCCTATGAAGGAAATTCTAAATGACAAGTTAGTAGCTACTGTTAAGGCTGAAGATTCAGTTTGTGAGCCTTCATTAGCAAATCATATGTCTAATGTATTCTCGAATGCATTTGAGGAGTTGTGTGGAGGTGTAGAAGTAGTTGATGAAGGGGAAATTGGTGAACTTCCACCTGGATTTGAAGAGAACTCGCACACAATTTTCCCACCACCGAATTTGAAGATTCGACCTTCAAGGGTAGCTGAATGTAATCCTAAGATTACAGAATATGTTGCAACTGCACTGTGCCGACAAAAGTTGCATAATGAAGTCCTAGAAGAGTGGAAATCCGCTTTTTTTTATCCTACATTGAATCAAGCTTTTATGTCTAATAAGAAACGCAGTCATTCTGGTATTCATGAGAAAGGAAAAGCAAAGAAAGCAAGGAAGGAACCTTTAAATGATGCTACTTCTGGACTGGGAAAGATGAAAGGAGGAGCAAAAGGCTCTTCTGCAGTTCCTCTAGTTAATGGAAAGTATACATACCACCGCAAGAAACTGTCACACAAAGAGTTGTGTTCCTCTCAATCTGCTTCAGTGGATGATTCGAGGCCAGGGAAGCAGAACGTGTGTAAGTTAAGGAATTATGTTTTGGGAGATTTGAATGAAACTGCAGAAGTCAAAATAGCTGCTAAGCGTGGAAAGGCTAGTGTGGTTAAAGGGAAGAAAGATAAATCTAATAAGAGCAGGTCATCTATCATTGTCAATGGTAGCACAGATGGTGATCGATTGTCCTTGAAGAATAAAACTAGTTCGAAAGCATTGAAACTTTCACATACTGATGGTGTTGTGGATGCCGTAAAATCTAATGAAAGGAGGCTTTCTGCGTCAACAAATAATAGTGTTGGAATGAAGAAGGTGGTTAAAAGCAATGCTGAGGATGTCCTAAAATCAAATGAAAAGAAGCTTTCAGCATCAACAAATAATAGTGTTTCCATGAAGAAGGTGGCTAAAAGCAATTGCAGTGATGGCACCATTAAGGGGAAGGCTGCTGGTCATTGCTCCAAACAGAGACCAAGTGCAAATAAAATGTCGAAACAAAAAAGGAAACATTCAACAGATGGTATGCCATCCTTACATCCTGCCAAGTCTTTGAAAATTTCAAACGATGGTGCAAAGCATGGAGCAAGCAAACATGCTACTATTGCAAGGAGGAATTCTGCCAAATCCAAGCCATTGAATTTATGTCCTAGATCTGATGGATGTGCTAGGACTTCCATTGATGGGTGGGAATGGCATAGATGGTCTCAAAGTGCTACTCCTGCATATAGAGCACGTGTCAGGGGCATTGCCTGTATACAAAACAAGTGCATAGATTCAGATAATAATTTATTGCAGCTATCGAATAATAAGGGTCTTTCTGCAAGAACAAATAGGATGAAATTGCGCAACCTTCTTGCTGCTGCCGAGGGTGCAGATCTTCTGAAAGTGCCTCAATTGAAGGCAAGGAAAAAACGATTACGTTTTCAAAGGAGCAAGATACATGACTGGGGTCTTGTTGCAATGGAGCCTATAGAGGCAGAGGACTTCGTGATTGAATATGTTGGAGAACTAATTCGTCCTCGGATATCTGATATCCGTGAACGTCAGTATGAGAAGATGGGAATTGGAAGCAGTTACCTTTTCAGACTTGATGATGGTTATGTGGTTGATGCTACAAAGAGAGGTGGGATTGCAAGATTTATTAACCATTCTTGTGAGCCCAACTGCTATACAAAGGTCATCTCTTTCGAGGGTCAAAAGAAGATTTTCATATATTCAAAACGGCATATTGCTGCTGGTGAAGAGATTACTTATAATTATAAGTTCCCGTTGGAGGATAAAAAGATTCCCTGCAACTGTGGTTCCAGAAAGTGTCGTGGATCACTTAATTAG 3216 0.3986 MVPPHLWIWQYMGRISLLTMLLLWLLTLARSQFRIHMAITALMNLTTNVQSQSCSTLGEERCWLYEDEKGMKHGPHSISELISWHHHGYLQDSSMISHFGNKYGSFLLVAAVNALKGDTSGTICRSGSNNNELGGTVNLICEISEDISSQLHLGVMKSARRVVLDGVISDIIAEFFTEKKHKRQKLDSTNLASETSVVDSKRSNVAVEISKDTAVPSEHASSHIADDQTCLEISRPSSTSFKSVGSVENFWWSYAVVRKVLVDYCMQVTWNAVFFDPLAEYLCSWRKKKLWSHPNLQIFVNGCGEYDGKIKSEALLLGTGCSKYPTDGCNQFGVLTTGTDSHSKLPSLSSNVPKDGNLMEGQRVSCTYSNSKNLTCIIESVENELHFSSKVSLAEYFRSFVEKEVNKIIHSPHEDKLSKVAVSVSGFSELHTGETPMKEILNDKLVATVKAEDSVCEPSLANHMSNVFSNAFEELCGGVEVVDEGEIGELPPGFEENSHTIFPPPNLKIRPSRVAECNPKITEYVATALCRQKLHNEVLEEWKSAFFYPTLNQAFMSNKKRSHSGIHEKGKAKKARKEPLNDATSGLGKMKGGAKGSSAVPLVNGKYTYHRKKLSHKELCSSQSASVDDSRPGKQNVCKLRNYVLGDLNETAEVKIAAKRGKASVVKGKKDKSNKSRSSIIVNGSTDGDRLSLKNKTSSKALKLSHTDGVVDAVKSNERRLSASTNNSVGMKKVVKSNAEDVLKSNEKKLSASTNNSVSMKKVAKSNCSDGTIKGKAAGHCSKQRPSANKMSKQKRKHSTDGMPSLHPAKSLKISNDGAKHGASKHATIARRNSAKSKPLNLCPRSDGCARTSIDGWEWHRWSQSATPAYRARVRGIACIQNKCIDSDNNLLQLSNNKGLSARTNRMKLRNLLAAAEGADLLKVPQLKARKKRLRFQRSKIHDWGLVAMEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVISFEGQKKIFIYSKRHIAAGEEITYNYKFPLEDKKIPCNCGSRKCRGSLN 1071
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Arst4g01542 1071 Gene3D - 46 130 IPR035445 -
Arst4g01542 1071 MobiDBLite consensus disorder prediction 771 835 - -
Arst4g01542 1071 ProSiteProfiles SET domain profile. 932 1049 IPR001214 GO:0005515
Arst4g01542 1071 ProSiteProfiles GYF domain profile. 60 116 IPR003169 GO:0005515
Arst4g01542 1071 SMART set_7 932 1055 IPR001214 GO:0005515
Arst4g01542 1071 MobiDBLite consensus disorder prediction 559 595 - -
Arst4g01542 1071 Gene3D SET domain 862 1071 IPR046341 -
Arst4g01542 1071 SUPERFAMILY SET domain 927 1068 IPR046341 -
Arst4g01542 1071 PANTHER HISTONE-LYSINE N-METHYLTRANSFERASE SETD1 50 1071 IPR044570 GO:0042800|GO:0051568
Arst4g01542 1071 Pfam SET domain 944 1048 IPR001214 GO:0005515
Arst4g01542 1071 ProSiteProfiles Post-SET domain profile. 1055 1071 IPR003616 -
Arst4g01542 1071 SUPERFAMILY GYF domain 50 112 IPR035445 -
Arst4g01542 1071 SMART PostSET_3 1055 1071 IPR003616 -
Arst4g01542 1071 CDD SET_SETD1 920 1067 IPR037841 GO:0042800|GO:0048188
       

Duplication type information


Select Gene Chromosome Start End Duplicated_type
Arst4g01542 Arst-Chr4 21594594 21603430 Dispersed/Tandem
       

Functional genes information


Select Gene Gene_start Gene_end Function Ath_gene Identity(%) E-value Score
Arst4g01542 931 1071 C2H2 Transcription Factor Family AT2G23740 34.969 1.77e-17 86.3
       

Transcription factors information


Select Regulatory Factors Family Gene Hmm_acc Hmm_name E_value Clan
TR SET Arst4g01542 SET 4.1e-20 No_clan
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Arst4g01542 - - aip:107636228 1945.63