Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Gso4g2043 ATGATTTTATCACGGGATGATACAAAAGTTGTGCTAGTTGCATGGTTTCTATTAATGATCTCAGCAATGTCGCAAACAATTCGTCCTAGCTGCCAAAATAAATGTGGAAGTGTGAACATTCCATATCCTTTTGGTACAACTGAGAACTGCTGCTTGAACAGAAACTTTTATGTTGCTTGTAACACAAGTCACAACCCTCCCAAACCATTTCTGTGGAATGTGACAAAAAATATTGAGATTTTAGAGGTGTCGTTGAATGGCCATTTGAGGATAAAATCCCCTGTAGCTTATGTTTGCTATGATGAAAAGGGTGTGCTAGTGGATTCCGGAAATTCATTCATGACTTTGCAAGCATTCCACTTCTCATACTCTCAGAACAAGTTCATTGGAATTGGATGTGACACCCTTTCAACCATCAATGCAACAATTGGGAAAAACTACTCAGCTGGAGGGTGTTTTTCACTTTGTAGTAGTGTAGAAAGTTCAGCCAATGGATCCTGGTTTGGCATTGGTTTCTGCCAAACCTCTATCCCAAAAAACATTTTGGCATATCAAGCTCGTGTCCTGAGATCAAATTTGATGCATAGTGATATGAATATTCCTTGTGCCTACTCTCTTTTGGTTGAAGAGGACTCTTTCAAGTTCTCCACGGATGACTTCATCAAACTTCAGAAGACAAAAACAGCTACCACAGTGCTTGATTGGGCTGTTGGAAATCAGACATGTCAAGAAGCTAAGAAGAATTTAACTAGTTATGCCTGCCAAGCAAATAGTGTATGTATTGATTCTGACAATGGACCAGGGTACCTTTGCAGATGCTTAGAAGGTTATGTGGGAAATGCATACCTTCATGGTGGATGCCAAGATATTGATGAGTGCGCCAATCCGAGTCTAAATGACTGTTCAGATATTTGCCTTAACCTACCTGGGAGCTATAATTGCTCTTGTCCCAAGAGCAAGAGTTATGAGGGAGATGGCAGAAAAGGAGGTAGTGGCTGTGTCTCGAATCTACCACATGTGGTTAATCAGATTGTGATAGGCACTGGCATAGGCCTTATGCTGTTGTTAATTGGTAGTGGCTGGCTATTCCATGTGTTCCGTAAAAGAAAAATGGTGAGACTCACAGCAAGATATTTTAAGAGAAATGGTGGCTTAATGTTGCAGCAGCAAATTGCGAACATGGAAGGATCATCTGAGAGAGCCAAAATTTTCACTGCAACTGAGCTAAAGAAAGCCTCCGAAAACTTTCATGAGAGCAGAATCATTGGCAGAGGAGGATACGGTACGGTTTATAGGGGAATACTTCCAAATGACAAAGTTGTTGCTATCAAGAAATCAAAACTAGTAGACCACAGCCAAATTGAACAATTCATCAATGAAGTGGTTGTACTGTCCCAAATCAACCACAGAAATGTTGTCAAACTCCTAGGTTGCTGTCTAGAGACAGAAATGCCACTACTGGTTTATGAATTTGTGAACAATGGCACCCTCTTTGACCACATTCACAACAAAAACACCACACTTCCCTGGGTAACACGTCTAAGAATAGCAGCAGAAACGGCTGGTGTGCTTGCATACTTGCATTCTGCTGCCTCCATACCAGTCATCCATAGGGACTTTAAGTCAACTAACATACTGTTGGATGACAAATACACAGCTAAGGTTTCTGACTTTGGAACCTCGAGATTGGTTCCACGTGACAAGTGTCAATTAACAACATTGGTTCAGGGAACTCTTGGCTACCTTGACCCTGAGTACTTTCAGACCAGCCAACTAACTGAGAAAAGCGATGTGTATAGCTTCGGTGTCGTGCTTGCGGAGTTGCTAACTGGAAGAAGGGCACTTTCTTTTGACATGCCAGAGGAAGAGAGAAACTTAGCCTTGTATTTCCTTTCAGCTGTGAAGGATGATTGCTTGTTCCAAATTGTAGAGGATTGTGTGAGTGAGGGAAATAGTGAGCAGGTGAAGGAAGTTGCTAACATTGCTCAATGGTGCTTGAGGCTTAGAGGTGAGGAAAGACCTACCATGAAGGAAGTGGCAATGGAATTGGATAGTCTTAGAATGATGACAACAACAACTACATGGATTAATGCCGCATCAAATTCAACAGAATATGTGATTGGTGAAAGATCAGGTCGAACAGAAACAACAGACTATGCTAATTGTCATTATACTACATGTGCTGGGCATGAGGATGACACCATTTGCAACGATGTGATGTCACCGTTATGGGATGGTAGAATTTCATGGCCAGTAGCCAGCGACTGCTACGCTGAAAGAGGCAAACTCGTGAGCCAAACATTCCAAGACATAAACCTAACAACTTTTCAAGTCTCTTCGAACCGAAACAAGTTTACTGTAATCGGGTGCGACACGCTGGGAGTAGTGGTGGGAATCGATTCAAAGGGAAGGAACTACACCACGGGATGCGTGTCCCTGTGCAACAGGCTCCAAGACATCGAAACGAACGGGTCTTGTTCCGGCACCGGTTGTTACGAGACTTCGATCCCGCGCGGTTTGTCGGGTTTCTCCTACGGTTCAGGGAGCGTGGTCGACTTCAACTTGTGCGGGCACGCGTTCCTCACGGACCTCGTCAACTTCGACAAGACAACGTTCCCCGTTGGGATGGATTGGGTAGTGAAGAACCAAACGTGCCAAGAAGCTATGAAGAAGATAGGTTACAAGAGCACACAATTGAAAGAAAAGCAGCTGCAAATGGGTGTCCAATATTCAAAGCTGCTGTTGCAGCTGCTACTTGTTGCGTTCTTTATTGCAGCAACTAAAACTGAACCACCTTTGTCAAAGCCAAACTGCCAACAAAAGTGTGGCAACGTTATCATTCCCTTTCCCTTCGGCATGACGGAGGCTTGTTCCCTCAACACCTCATTTCTCATCACTTGCCACCAAAACCTTTCACCACCCACCCCATTCTTGCAAAATTTTTATCAAATTAGCGTGCTAGACATCTCACTCGAGTACGGCCAATTGAACATTTCATTGCCAGTAGCCAGAAACTGCCTCATCAATAATCTAACTGGGGAATCAGTTATTGAAATGAATCTCGGACCTTTTCACCTATCATCCAACCAAAACAAGTTAATTGTGTTTGGTGCTGACGCGGCTGGATTGGTATATAATCTGGAAAATGCATCAGGAATACTGTACCCTACTATTGCATGCATGTCTGTGTACGCTCCTGCAGCCTCAGCACCTGACAAGTCATGCTCTGGCACCTTATGTTGCGAGACTCCAATACAACAAAGGCTATCAGAATTCTTCTATGAATCTTCTACAAATATCTTCCGCAGAAATAACACCAAGAGACTTGAGTCCTATCCATGTGGTTATACATTTCTGGTTAAGGACGGAGCCTACAAGTTCCACATCACAGACATCTTCAACCTTAGCACTAACAACAAATTTCCTGTTGTTGCGGATTGGGCAGTGGGGACTCACACGTGCCAAGATGCTATGAAGAATGCTTCAAGTTATCTGTGTAAGTCAAACTATAGTGAATGTCGCGACGCAGAGGTGGGACCTGGTTACCATTGTAAATGCTACAGTGGTTATCGGGGCAACCCTTACCTCTCTAATGGTTGCCAAGACGTTGATGAATGCAATGAGAAAACCCACAACTGTACAGAGGGATCAATATGCAGCAACAGTCCAGGGATCTACAGTTGTTCTTGTCTAAAAGGATATGAAGGAGATGGAAAAAATAATGGAACAGGATGTCGTCCCAAAGTCAGCAGTAGCCGCATAATCATAATTGCTTTGACTGTGAGTGTAAGTATCTTAACACTTCTTGGGGGAACCTTTTATATGTATTGGACATCGAAGAAGAGAAATCTCATCAGACTTAAAGAGCAATATTTTCAACAAAACGGTGGCCTGTTATTACAACAAGTTGTCAGATATAGTGGGTCAACTGAAATGACTAAAATCTTCACAGTGGAGGAACTAAGTCAGGCCACCAACAATTTCGATGAAAGCATGGTCTTAGGCCAAGGTGGCCAGGGAACAGTTTACAAAGGAATATTATCTAACAATAGAATTGTAGCAATAAAAATGTCCAGAATTAGTAACCCAAACCAAGTTGAGCATTTCATCAATGAGATGATATTGCTTTCTCAAATCAACCATAGAAATGTGGTGAAACTATTGGGATGTTGTTTAGAGACAGAAGTTCCCTTGCTTGTTTATGAATTTGTTCCCAATGGTACTGTTTACGAGCATCTTCACAATCAAGGTCAATCTTTAAAACTTACATGGAAAACAAGATTGCAAATAGCAACAGAAACTGCCAGGGCCCTGGCATACTTGCATTCTGCTACCAATGCACCAATCATACATAGAGATGTGAAAACTGCAAACATACTACTTGATCACAATCTCACTGCAAAGGTTTCTGATTTTGGAGCTTCAAGGATTATTCCTCTTGATCAAACTCAGTTAACCACTTTGGTGCTGGGGACACTAGGGTATCTTGACCCAGAATACTTCCACTCAAGCCAGTTAACAGAGAAAAGTGATGTCTATAGCTTTGGAGTTGTCTTAGTTGAACTACTGACAGGAAAGAAGGCACTCTCTTTTGAAAGGCCAGAGGCTCATAGAAATCTTGCAGTGCACTTCCATTCTTCAATGCAAGAGGGTCGGTTACTTAACATTGTAGACAGCCACATAATAGATGAGGCAAATGTTGAGCAACTAATGGATGTTGCTAACATTGCAAACCATTGTTTAAGGTTGAAAGGGGAGGAAAGACCCACCATGAAAGACGTGGCAATGGAACTTGAAGGAATAAGCGTTGTGGAGAAGCACCAATGGGAAAAAATCAATTTGTCATCAGAGGAGACTGAAAATTTTCTCAAAGCAATACCATCATCTTCTTTTAGTATTGTAGATGGCGTTAATAGAAGAAGCATTAATTCTGGCTCTAATATTTTAAACCGAATTTCCTTTTCCTTGAGTGGTGGAAGATGA 4983 0.4222 MILSRDDTKVVLVAWFLLMISAMSQTIRPSCQNKCGSVNIPYPFGTTENCCLNRNFYVACNTSHNPPKPFLWNVTKNIEILEVSLNGHLRIKSPVAYVCYDEKGVLVDSGNSFMTLQAFHFSYSQNKFIGIGCDTLSTINATIGKNYSAGGCFSLCSSVESSANGSWFGIGFCQTSIPKNILAYQARVLRSNLMHSDMNIPCAYSLLVEEDSFKFSTDDFIKLQKTKTATTVLDWAVGNQTCQEAKKNLTSYACQANSVCIDSDNGPGYLCRCLEGYVGNAYLHGGCQDIDECANPSLNDCSDICLNLPGSYNCSCPKSKSYEGDGRKGGSGCVSNLPHVVNQIVIGTGIGLMLLLIGSGWLFHVFRKRKMVRLTARYFKRNGGLMLQQQIANMEGSSERAKIFTATELKKASENFHESRIIGRGGYGTVYRGILPNDKVVAIKKSKLVDHSQIEQFINEVVVLSQINHRNVVKLLGCCLETEMPLLVYEFVNNGTLFDHIHNKNTTLPWVTRLRIAAETAGVLAYLHSAASIPVIHRDFKSTNILLDDKYTAKVSDFGTSRLVPRDKCQLTTLVQGTLGYLDPEYFQTSQLTEKSDVYSFGVVLAELLTGRRALSFDMPEEERNLALYFLSAVKDDCLFQIVEDCVSEGNSEQVKEVANIAQWCLRLRGEERPTMKEVAMELDSLRMMTTTTTWINAASNSTEYVIGERSGRTETTDYANCHYTTCAGHEDDTICNDVMSPLWDGRISWPVASDCYAERGKLVSQTFQDINLTTFQVSSNRNKFTVIGCDTLGVVVGIDSKGRNYTTGCVSLCNRLQDIETNGSCSGTGCYETSIPRGLSGFSYGSGSVVDFNLCGHAFLTDLVNFDKTTFPVGMDWVVKNQTCQEAMKKIGYKSTQLKEKQLQMGVQYSKLLLQLLLVAFFIAATKTEPPLSKPNCQQKCGNVIIPFPFGMTEACSLNTSFLITCHQNLSPPTPFLQNFYQISVLDISLEYGQLNISLPVARNCLINNLTGESVIEMNLGPFHLSSNQNKLIVFGADAAGLVYNLENASGILYPTIACMSVYAPAASAPDKSCSGTLCCETPIQQRLSEFFYESSTNIFRRNNTKRLESYPCGYTFLVKDGAYKFHITDIFNLSTNNKFPVVADWAVGTHTCQDAMKNASSYLCKSNYSECRDAEVGPGYHCKCYSGYRGNPYLSNGCQDVDECNEKTHNCTEGSICSNSPGIYSCSCLKGYEGDGKNNGTGCRPKVSSSRIIIIALTVSVSILTLLGGTFYMYWTSKKRNLIRLKEQYFQQNGGLLLQQVVRYSGSTEMTKIFTVEELSQATNNFDESMVLGQGGQGTVYKGILSNNRIVAIKMSRISNPNQVEHFINEMILLSQINHRNVVKLLGCCLETEVPLLVYEFVPNGTVYEHLHNQGQSLKLTWKTRLQIATETARALAYLHSATNAPIIHRDVKTANILLDHNLTAKVSDFGASRIIPLDQTQLTTLVLGTLGYLDPEYFHSSQLTEKSDVYSFGVVLVELLTGKKALSFERPEAHRNLAVHFHSSMQEGRLLNIVDSHIIDEANVEQLMDVANIANHCLRLKGEERPTMKDVAMELEGISVVEKHQWEKINLSSEETENFLKAIPSSSFSIVDGVNRRSINSGSNILNRISFSLSGGR* 1661
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Gso4g2043 1660 Pfam Calcium-binding EGF domain 1202 1242 IPR001881 GO:0005509
Gso4g2043 1660 Pfam Calcium-binding EGF domain 289 319 IPR001881 GO:0005509
Gso4g2043 1660 ProSiteProfiles Protein kinase domain profile. 1328 1610 IPR000719 GO:0004672|GO:0005524|GO:0006468
Gso4g2043 1660 Gene3D Laminin 292 329 - -
Gso4g2043 1660 Gene3D Laminin 240 291 - -
Gso4g2043 1660 SUPERFAMILY EGF/Laminin 1201 1237 - -
Gso4g2043 1660 Pfam Protein kinase domain 416 679 IPR000719 GO:0004672|GO:0005524|GO:0006468
Gso4g2043 1660 Pfam Protein kinase domain 1330 1594 IPR000719 GO:0004672|GO:0005524|GO:0006468
Gso4g2043 1660 PANTHER WALL-ASSOCIATED RECEPTOR KINASE 2-LIKE ISOFORM X2 24 707 - -
Gso4g2043 1660 PANTHER WALL-ASSOCIATED RECEPTOR KINASE 2-LIKE ISOFORM X2 914 1624 - -
Gso4g2043 1660 PANTHER WALL-ASSOCIATED RECEPTOR KINASE 2-LIKE ISOFORM X2 742 858 - -
Gso4g2043 1660 ProSitePatterns Calcium-binding EGF-like domain signature. 1202 1228 IPR018097 GO:0005509
Gso4g2043 1660 ProSiteProfiles EGF-like domain profile. 1202 1240 IPR000742 -
Gso4g2043 1660 SUPERFAMILY Protein kinase-like (PK-like) 1309 1599 IPR011009 -
Gso4g2043 1660 Pfam Wall-associated receptor kinase galacturonan-binding 31 89 IPR025287 GO:0030247
Gso4g2043 1660 Pfam Wall-associated receptor kinase galacturonan-binding 938 998 IPR025287 GO:0030247
Gso4g2043 1660 SMART serkin_6 1328 1601 IPR000719 GO:0004672|GO:0005524|GO:0006468
Gso4g2043 1660 SMART serkin_6 416 689 IPR000719 GO:0004672|GO:0005524|GO:0006468
Gso4g2043 1660 ProSitePatterns Calcium-binding EGF-like domain signature. 289 314 IPR018097 GO:0005509
Gso4g2043 1660 SMART egfca_6 1202 1246 IPR001881 GO:0005509
Gso4g2043 1660 SMART egfca_6 289 334 IPR001881 GO:0005509
Gso4g2043 1660 CDD EGF_CA 1202 1237 - -
Gso4g2043 1660 CDD EGF_CA 289 317 - -
Gso4g2043 1660 CDD STKc_IRAK 1334 1601 - -
Gso4g2043 1660 CDD STKc_IRAK 422 686 - -
Gso4g2043 1660 ProSiteProfiles Protein kinase domain profile. 416 696 IPR000719 GO:0004672|GO:0005524|GO:0006468
Gso4g2043 1660 ProSitePatterns Serine/Threonine protein kinases active-site signature. 535 547 IPR008271 GO:0004672|GO:0006468
Gso4g2043 1660 ProSitePatterns Protein kinases ATP-binding region signature. 422 445 IPR017441 GO:0005524
Gso4g2043 1660 Gene3D Phosphorylase Kinase; domain 1 1303 1403 - -
Gso4g2043 1660 ProSitePatterns Serine/Threonine protein kinases active-site signature. 1449 1461 IPR008271 GO:0004672|GO:0006468
Gso4g2043 1660 SUPERFAMILY Protein kinase-like (PK-like) 402 700 IPR011009 -
Gso4g2043 1660 PANTHER WALL-ASSOCIATED RECEPTOR KINASE-LIKE 21 742 858 IPR045274 GO:0007166
Gso4g2043 1660 PANTHER WALL-ASSOCIATED RECEPTOR KINASE-LIKE 21 24 707 IPR045274 GO:0007166
Gso4g2043 1660 PANTHER WALL-ASSOCIATED RECEPTOR KINASE-LIKE 21 914 1624 IPR045274 GO:0007166
Gso4g2043 1660 Gene3D Transferase(Phosphotransferase) domain 1 1404 1623 - -
Gso4g2043 1660 Gene3D Transferase(Phosphotransferase) domain 1 492 705 - -
Gso4g2043 1660 Gene3D Plasmodium vivax P25 domain 1119 1250 - -
Gso4g2043 1660 SMART egf_5 1153 1201 IPR000742 -
Gso4g2043 1660 SMART egf_5 241 288 IPR000742 -
Gso4g2043 1660 SMART egf_5 292 334 IPR000742 -
Gso4g2043 1660 SMART egf_5 1205 1246 IPR000742 -
Gso4g2043 1660 SUPERFAMILY EGF/Laminin 288 325 - -
Gso4g2043 1660 ProSitePatterns Aspartic acid and asparagine hydroxylation site. 1219 1230 IPR000152 -
Gso4g2043 1660 Gene3D Phosphorylase Kinase; domain 1 381 491 - -
       

Duplication type information


Select Gene Chromosome Start End Duplicated_type
Gso4g2043 Gso-Chr4 50811892 50824666 Dispersed/Wgd
       

Functional genes information


Select Gene Gene_start Gene_end Function Ath_gene Identity(%) E-value Score
Gso4g2043 751 891 Receptor kinase-like Gene Family AT1G21230 29.940 2.86e-10 63.2
       

Transcription factors information


Select Regulatory Factors Family Gene Hmm_acc Hmm_name E_value Clan
PK RLK-Pelle_WAK Gso4g2043 Pkinase 1.6e-47 CL0016
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Gso4g2043 - - gsj:114410917 3325.03
       

Event-related genes


Select Gene_1 Chr_1 Start_1 End_1 Gene_2 Chr_2 Start_2 End_2 Event_name
Gso4g2043 4 50811892 50824666 Gso6g1008 6 9781614 9792584 GST
Gso14g1245 14 30278969 30291786 Gso4g2043 4 50811892 50824666 PCT
Gso13g0198 13 11059025 11063459 Gso4g2043 4 50811892 50824666 PCT
Gso4g2043 4 50811892 50824666 Gso14g1245 14 30278969 30291786 PCT
Gso4g2043 4 50811892 50824666 Gso13g0198 13 11059025 11063459 PCT