Gene search


Sequence information


Select Gene Cds Cds_length GC_content Pep Pep_length
Gso1g0651 ATGGGAGTCCACGGTCTCTGGGAACTCCTCGCCCCCGTCGGTCGCCGCGTCTCCGTCGAAACCCTTGCCGGAAAAACCCTAGCCGTTGATGCGAGCATATGGATGGTACAGTTTGTCAAAGCGATGCGCGACGAGAAGGGCGAAATGGTTCGCAACGCCCATTTGCTGGGCTTCTTCCGTCGCATTTGCAAACTTCTTTTCCTCCGCACCAAGCCGGTCTTCGTCTTCGACGGCGGGACCCCTGCCCTCAAGCGCCGCACTGTCATTGCACGCCGCCGCCAGCGTGAAAACGCCCAAGCCAAAGTCCGCAAAACCGCCGAGAAATTGCTGCTCAATCATTTAAAGGCATTAAGGTTGAAGGAACTGGCTGATGATCTTAAGAACCAAAGGATGAAGAAGAATAGTGATACTAAGGGTCAGAAAAAATCTAATCAGAAGGACTTTGTTGGAAGTGATTTAGGAGGAAGTCATGTGAAAGAGCTAGATGAAATGTCTGTGGCTAAATATGCAGCTAAGGAGGACGGAAATTCTTCGCAGGCAACAATATTGACTACTTACAACCAGGAGGAACTTCATGAAATGTTGGCAGCGTCTATAGCTGCAGAGAAAAATGGCATACATGCCAGAAAAGGAATGCCGTCTATTGTAATTAATCCTTTAGAGGAGGAACGTGATGCAGACGAACAAATTATATTGCCATCAGTAAATGCAGAAGTTGATATGGCTGTATTAGCTGCCTTACCACAATCAATGCAACTTGATATTCTTGCACAGCTTAAAGGAAAGAAAACCGAAGGACTAGTAAAGGAAGTTGACAATCAGAATCAACATGATGTCAATTATCGGGGCAAGGGTAAGGGGATTCTGCTCATTGAAGCTGACATGGTAGGTTGTAGCTCCAGACATGACAATGTTACATCAAGGAGTGACAATCAACACTCAATTGATGAAATGTTAGCTGCATCTATTGCTATGGAGGAAAATGAAGAGTTAGTAAATAATACATCAACTTCTGTTGGGGCTTCCGCTATTGAGGAAGAGGAAGTTGACTATGATGAAGATGAAGAAATGATACTGCCAGCTATGCATGGTAAAATTGATCCTGCTGTTCTAGCCTCATTGCCTCCATCAATGCAACTGGACCTTCTTGTTCAGATGAGAGAGCGTTTGATTGCAGAGAACAGACAAAAGTATCAGAAAGTCAAAAAGGATCCAGCAAAATTCTCTGAGCTACAGATACAAGCTTACCTTAAAACTGTCGCTTTCAGACGGGACATAGATGAAGTGCAGAAAGCTGCAGCTGTAGGAGGAGTAGGTGGTGTACAGACTTCACGGATTGCATCTGAAGCCAACAGAGAATATATTTTCTCGTCTTCTTTTACTGGTGATAAACAAGAACTTACCTCCACCAGCTTAGAGAAAAATAAAGATACACAACAGAAGGTCCAGGGAGTACATCCTTCACAGAACCTTACAGATAGCATTGTGGCAGGAAATGATTCTAATACTTCAAGCGGATTGGTTCACAATGAACCTGGTGAGCCTGCTGATGAAAGTATTCAGACATATCTTGATGAGAGGGGTCGGTTTCGAGTTAGTAGATTGAGAGCTATGGGGATGCGTATGACCTGTGATATACAACGGAATTTGGATTTGTTGAAGGAGATTGAGCAGGAAAGAGCATATGTGAACAAGGCTGCGAATATTGGAACAGTGGAAAATGCTGAGAATAATGGTCCATATGAAAGTTCTGGGATCCAGCTTGTTGGTAAATCACAAGAGATGAATGTTGACTTAGTTGGACAGAATATGCAAAATGAACAAACAATGCTTGACAGAGATACATTGATAGAGATATCTTTTGAATATGATTGCAAGAATAAGTTCGCAAATGATGAGGATGATATATTTTCTAGTTTAGTAGGAGGAAATCCAGTGGCAATCTTTGGTGCTGATGATACTGCAGCAACCGAACAACCTTCTCATTCTGATTCAGATTGTGATTGGGAAGAAGGAATTCTCGAAGGTAAGAGTAATGCTTATCCTGAACATGACGTGGTAGAATTGAAGTCTTCTGTTGCAGATGATCATAAAAATAATGAAAGAGAAGTAGAATGGGAGGAAGGAGATTGTGATGGTGCTAACAGCACCTTACTGTCAGGAAAATTGGCATCTCAAGGGTGGTTGGAGGAGGAGTCTGATTTGCAAGAGGCAATAAGGAGAAGTCTTGAAAGTATAGGGGATATGAAACTTAAATGCATGCCAGCTGTAGATGAGCATTCAAATACTTATGAGAACAAATTGGATTGTGGTTTAGAACATGGTGATGATCTGTATTATTCTGACCCTGTGGATTTAAATGACAACGTTGGGTTTCTGAATAATAAAAATAGGGAAGACAGTACTGAAAAAAATGAATTACATGAAATTGAAGATGGAGATAAAAAGCATGATTTTGTTTCTGGCAATAATGAACAAACTTTCCATTTTCATGGAAGTCAGTCAAAGTCATCTGTGACTTTTAATTCCAATAACACTGAGATATTGATCGACACACCTTGCAGAATGGACAGTCACTCTTGTTTTGTAGATTCAATTTCAGATACAAATGTAATGACGAAGGACCTAGTCCCTATGGTTGCTGAGCAATTATTGGATAAACATGATGATGGTAAAGTGTCTTTCTATTGTGACAATACATCCAAGGTTGATCCAGTTGGTGCAACTGAAGAGGGGAAAAAAAACTATATTCAAGAATCTGAACCATTGAGTAATTCTACTGACACTACCAAACCTGCTATTTTGGTAGAGTCATCCTTGAAAGGATCAACAGAAGACCTTGATATTGAGCCAAAATTGCCTTCAGAGGACAGTAATAGAAATTTCTACGAGGAAAGGAATAGTAGCCTTGGCAATGATGTGGTTAACACCCCGGGACATTTTCCTGCTCATGCAGCTGAGGTTAGCTTGGAGGAAGAAATGCAAATTCTTGGTCAAGAATATATCAACCTAGAAAATGAGCAGAGGAAGCTAGAGCGGAATGCAGAATCTGTAAACAGTGAATTATTCACTGAATGTCAGGAACTACTGCAAATGTTTGGCTTGCCATATATTATTGCTCCAATGGAAGCAGAAGCTCAATGTGCTTATTTGGAACTTGAGAAACTAGTTGATGGTGTTGTGACTGATGACTCTGATGTCCTTTTATTTGGGGCACGCAGTGTTTACAAAAATATATTTGATGACCGCAAATATGTAGAGACATACTTCATGGAGGATATTGAAAAGGAGCTTGGATTGACCAGAGAAAAATTAATACGCATGGCTCTACTTCTTGGGAGTGATTATACTGAAGGTGTAAGTGGGATTGGCATTGTTAATGCTATTGAGGTTGTGAATGCATTCCCTGAGGAAGATGGCCTCCTGAAATTCCGGCAATGGGTTGAGTCACCGGATCCCACCATCCTTGGAAGGTTGGATGCAAATAGTGGTTCAAATTCCAGAAAGAAAGGGTCAAAAATTGAAGAAAAGATGAATTCCTCAAGTTGCAATGTTAAAGAGTCTGCGGTGATGCAAAACATCTGCCATGCTCAGGAGCAAAATGAGTTGTCAGATTACATCCAAGAAATAAAACAAACTTTCTTCAATAAGCATAGAAATGTTAGCAAGAATTGGCACATTCCTTCTTCTTTTCCAAGTGATACTGTTATATCTGCTTACTATTCTCCCCATGTTGATAAATCCACTGAGCCATTCACATGGGGAAAGCCAGATCATCTTGTTCTTCGAAAATTGTGCTGGGAGAAATTTGGGTGGACTGGCCAGAAAGCAGATGAATTGATCCTACCAGTCTTAAAGGAGTATAACAAGCGTGAGACTCAATTGCGGTTGGAAGCATTTTACAATTTTAATGAAAGATTTGCAAAAATTCGTAGTAAAAGAATTAAGAAAGCGGTAAAAGGAATCACTGGTAAGCAGCCTTCAGATTTGATAGATGATTCTGCAGAAGAGTTCTCCAAGAGTAGGAAGACTGGGAGAGAACCTGAGGATATCACATTGGAGACTTCAAGGGGAATAGAGGGAAATCTTGAGGGTAGAAGGAAATCAAAAATAAAACAGTCAAGGAAGAATGATACTGTTGCTAAGGAACAGTCAAAGAAAAAGAAAGTCAATGATGATCCCTCTTCAGCACCTGGTACATCTGAGATTGAGAATTTACAGCCAAGTCTGCAGATAGAAGAAGAGCAACATGATGGTAAGGCATTGATTCGGAATAGAAGTGGCAGAGGAAGAGGTAGAATTATGGGAATAAAAAGAGGAAGGGATAACAAAGGTCTCAGTTTTCAATCTTGCGAAACTGAAGCCTCATCTGGTAGCAGTGACATTGATGATCATGGGCCAAGAGTGCATGTGGATAGAGTTCCAAAAGATGTGCGAAGGTCAATGCGATCTCGGAAACCTGTCAACTATTCTTTCAAAGAGCCTGAAGATGAAGATTCTGATGATTCATTTGATCGGAGAAATCAGACTGGTCCAATAGAAGAAAATTTATCTCATATTCTTGGTGCTTGTGAAGATGGTGCAACAGATTTCAGCATGGCGAAAGAATGCAGTGCAATGAATTTTCCTCCAGAGGAGAACTTGCCTACAGACTCCCTTGAGTCAGGTGGTTGGTTTTGCACAGATGCCGGTGAAACTTGTCATCCTGGTACTGGCAATCAGGACTCTTCTGATGACTACCTTAAAATGGGAGGTGGATTCTGTTTAGATGATGGTGATACAGGTGTCAAGCAGGATACAAGTGACAATGTCGATACTGCTACAGTCGATTATAATGCAGACTTTCCACACGGTTCTGATTATTTGGATGAAACTAATCGTGATAAAAGTAGTTCAGATATATTATTTTCTGGCGCTGAAAAGCCTGAAAATGGGATACAAGGTGGAGGGCCATTCAATATAGAGCCAAATGACCTTGCAAGTGCTAGTAGTTATGATCATTCTGATATAGCGGTCTTGAAACAGGAGAATACTCGCAACAATAGTGGAGCCTCCACTGGAGCATTTAGTGCCATGCCGTTTTTGAAGAGAAGAAGGAAAAACTGA 5106 0.4105 MGVHGLWELLAPVGRRVSVETLAGKTLAVDASIWMVQFVKAMRDEKGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGGTPALKRRTVIARRRQRENAQAKVRKTAEKLLLNHLKALRLKELADDLKNQRMKKNSDTKGQKKSNQKDFVGSDLGGSHVKELDEMSVAKYAAKEDGNSSQATILTTYNQEELHEMLAASIAAEKNGIHARKGMPSIVINPLEEERDADEQIILPSVNAEVDMAVLAALPQSMQLDILAQLKGKKTEGLVKEVDNQNQHDVNYRGKGKGILLIEADMVGCSSRHDNVTSRSDNQHSIDEMLAASIAMEENEELVNNTSTSVGASAIEEEEVDYDEDEEMILPAMHGKIDPAVLASLPPSMQLDLLVQMRERLIAENRQKYQKVKKDPAKFSELQIQAYLKTVAFRRDIDEVQKAAAVGGVGGVQTSRIASEANREYIFSSSFTGDKQELTSTSLEKNKDTQQKVQGVHPSQNLTDSIVAGNDSNTSSGLVHNEPGEPADESIQTYLDERGRFRVSRLRAMGMRMTCDIQRNLDLLKEIEQERAYVNKAANIGTVENAENNGPYESSGIQLVGKSQEMNVDLVGQNMQNEQTMLDRDTLIEISFEYDCKNKFANDEDDIFSSLVGGNPVAIFGADDTAATEQPSHSDSDCDWEEGILEGKSNAYPEHDVVELKSSVADDHKNNEREVEWEEGDCDGANSTLLSGKLASQGWLEEESDLQEAIRRSLESIGDMKLKCMPAVDEHSNTYENKLDCGLEHGDDLYYSDPVDLNDNVGFLNNKNREDSTEKNELHEIEDGDKKHDFVSGNNEQTFHFHGSQSKSSVTFNSNNTEILIDTPCRMDSHSCFVDSISDTNVMTKDLVPMVAEQLLDKHDDGKVSFYCDNTSKVDPVGATEEGKKNYIQESEPLSNSTDTTKPAILVESSLKGSTEDLDIEPKLPSEDSNRNFYEERNSSLGNDVVNTPGHFPAHAAEVSLEEEMQILGQEYINLENEQRKLERNAESVNSELFTECQELLQMFGLPYIIAPMEAEAQCAYLELEKLVDGVVTDDSDVLLFGARSVYKNIFDDRKYVETYFMEDIEKELGLTREKLIRMALLLGSDYTEGVSGIGIVNAIEVVNAFPEEDGLLKFRQWVESPDPTILGRLDANSGSNSRKKGSKIEEKMNSSSCNVKESAVMQNICHAQEQNELSDYIQEIKQTFFNKHRNVSKNWHIPSSFPSDTVISAYYSPHVDKSTEPFTWGKPDHLVLRKLCWEKFGWTGQKADELILPVLKEYNKRETQLRLEAFYNFNERFAKIRSKRIKKAVKGITGKQPSDLIDDSAEEFSKSRKTGREPEDITLETSRGIEGNLEGRRKSKIKQSRKNDTVAKEQSKKKKVNDDPSSAPGTSEIENLQPSLQIEEEQHDGKALIRNRSGRGRGRIMGIKRGRDNKGLSFQSCETEASSGSSDIDDHGPRVHVDRVPKDVRRSMRSRKPVNYSFKEPEDEDSDDSFDRRNQTGPIEENLSHILGACEDGATDFSMAKECSAMNFPPEENLPTDSLESGGWFCTDAGETCHPGTGNQDSSDDYLKMGGGFCLDDGDTGVKQDTSDNVDTATVDYNADFPHGSDYLDETNRDKSSSDILFSGAEKPENGIQGGGPFNIEPNDLASASSYDHSDIAVLKQENTRNNSGASTGAFSAMPFLKRRRKN* 1702
       

Annotation information


Select Seq ID Length Analysis Description Start End IPR GO
Gso1g0651 1701 MobiDBLite consensus disorder prediction 472 509 - -
Gso1g0651 1701 ProSitePatterns XPG protein signature 2. 1034 1048 IPR019974 GO:0016788
Gso1g0651 1701 ProSitePatterns XPG protein signature 1. 70 84 IPR019974 GO:0016788
Gso1g0651 1701 MobiDBLite consensus disorder prediction 1676 1701 - -
Gso1g0651 1701 CDD H3TH_XPG 1103 1247 - -
Gso1g0651 1701 SMART xpgineu 1031 1100 IPR006086 GO:0004518
Gso1g0651 1701 MobiDBLite consensus disorder prediction 1612 1661 - -
Gso1g0651 1701 Gene3D - 1 166 - -
Gso1g0651 1701 MobiDBLite consensus disorder prediction 128 155 - -
Gso1g0651 1701 MobiDBLite consensus disorder prediction 128 145 - -
Gso1g0651 1701 PRINTS Xeroderma pigmentosum group G/yeast RAD superfamily signature 24 38 IPR006084 -
Gso1g0651 1701 PRINTS Xeroderma pigmentosum group G/yeast RAD superfamily signature 72 91 IPR006084 -
Gso1g0651 1701 MobiDBLite consensus disorder prediction 1158 1178 - -
Gso1g0651 1701 Pfam XPG I-region 1034 1115 IPR006086 GO:0004518
Gso1g0651 1701 Pfam Ubiquitin binding region 233 258 IPR025527 -
Gso1g0651 1701 Pfam Ubiquitin binding region 366 390 IPR025527 -
Gso1g0651 1701 Coils Coil 987 1021 - -
Gso1g0651 1701 Pfam XPG N-terminal domain 1 97 IPR006085 GO:0004518
Gso1g0651 1701 MobiDBLite consensus disorder prediction 1330 1368 - -
Gso1g0651 1701 PANTHER DNA REPAIR PROTEIN COMPLEMENTING XP-G CELLS-RELATED 259 1700 - -
Gso1g0651 1701 PANTHER DNA REPAIR PROTEIN COMPLEMENTING XP-G CELLS-RELATED 1 262 - -
Gso1g0651 1701 SMART HhH_4 1102 1135 IPR008918 GO:0003677|GO:0003824
Gso1g0651 1701 CDD PIN_XPG_RAD2 1017 1101 - -
Gso1g0651 1701 PANTHER DNA REPAIR PROTEIN COMPLEMENTING XP-G CELLS 1 262 - -
Gso1g0651 1701 PANTHER DNA REPAIR PROTEIN COMPLEMENTING XP-G CELLS 259 1700 - -
Gso1g0651 1701 MobiDBLite consensus disorder prediction 1321 1517 - -
Gso1g0651 1701 MobiDBLite consensus disorder prediction 472 517 - -
Gso1g0651 1701 Gene3D - 1100 1154 - -
Gso1g0651 1701 SUPERFAMILY 5' to 3' exonuclease, C-terminal subdomain 1100 1309 IPR036279 -
Gso1g0651 1701 MobiDBLite consensus disorder prediction 1374 1393 - -
Gso1g0651 1701 SUPERFAMILY PIN domain-like 2 1115 IPR029060 -
Gso1g0651 1701 SMART xpgn3 1 98 IPR006085 GO:0004518
Gso1g0651 1701 MobiDBLite consensus disorder prediction 1458 1509 - -
Gso1g0651 1701 MobiDBLite consensus disorder prediction 1411 1425 - -
Gso1g0651 1701 Gene3D - 982 1099 - -
Gso1g0651 1701 PRINTS Xeroderma pigmentosum group G protein signature 2 19 IPR001044 GO:0003697|GO:0004519|GO:0005634|GO:0006289
Gso1g0651 1701 PRINTS Xeroderma pigmentosum group G protein signature 54 77 IPR001044 GO:0003697|GO:0004519|GO:0005634|GO:0006289
Gso1g0651 1701 PRINTS Xeroderma pigmentosum group G protein signature 96 118 IPR001044 GO:0003697|GO:0004519|GO:0005634|GO:0006289
Gso1g0651 1701 CDD PIN_XPG_RAD2 2 92 - -
Gso1g0651 1701 MobiDBLite consensus disorder prediction 1394 1410 - -
Gso1g0651 1701 Gene3D - 227 265 - -
       

Duplication type information


Select Gene Chromosome Start End Duplicated_type
Gso1g0651 Gso-Chr1 18452891 18464556 Dispersed/Transposed
       

Functional genes information


Select Gene Gene_start Gene_end Function Ath_gene Identity(%) E-value Score
Gso1g0651 1025 1132 Core DNA Replication Machinery Family AT5G26680 36.937 7.24e-13 70.9
       

Pathway information


Select Query KO Definition Second KO KEGG Genes ID GHOSTX Score
Gso1g0651 K10846 - gmx:100820295 3346.21