Unique genes in plants: specificities and conserved features throughout evolution
- PMID: 18847470
- PMCID: PMC2576244
- DOI: 10.1186/1471-2148-8-280
Unique genes in plants: specificities and conserved features throughout evolution
Abstract
Background: Plant genomes contain a high proportion of duplicated genes as a result of numerous whole, segmental and local duplications. These duplications lead up to the formation of gene families, which are the usual material for many evolutionary studies. However, all characterized genomes include single-copy (unique) genes that have not received much attention. Unlike gene duplication, gene loss is not an unspecific mechanism but is rather influenced by a functional selection. In this context, we have established and used stringent criteria in order to identify suitable sets of unique genes present in plant proteomes. Comparisons of unique genes in the green phylum were used to characterize the gene and protein features exhibited by both conserved and species-specific unique genes.
Results: We identified the unique genes within both A. thaliana and O. sativa genomes and classified them according to the number of homologs in the alternative species: none (U{1:0}), one (U{1:1}) or several (U{1:m}). Regardless of the species, all the genes in these groups present some conserved characteristics, such as small average protein size and abnormal intron number. In order to understand the origin and function of unique genes, we further characterized the U{1:1} gene pairs. The possible involvement of sequence convergence in the creation of U{1:1} pairs was discarded due to the frequent conservation of intron positions. Furthermore, an orthology relationship between the two members of each U{1:1} pair was strongly supported by a high conservation in the protein sizes and transcription levels. Within the promoter of the unique conserved genes, we found a number of TATA and TELO boxes that specifically differed from their mean number in the whole genome. Many unique genes have been conserved as unique through evolution from the green alga Ostreococcus lucimarinus to higher plants. Plant unique genes may also have homologs in bacteria and we showed a link between the targeting towards plastids of proteins encoded by plant nuclear unique genes and their homology with a bacterial protein.
Conclusion: Many of the A. thaliana and O. sativa unique genes are conserved in plants for which the ancestor diverged at least 725 million years ago (MYA). Half of these genes are also present in other eukaryotic and/or prokaryotic species. Thus, our results indicate that (i) a strong negative selection pressure has conserved a number of genes as unique in genomes throughout evolution, (ii) most unique genes are subjected to a low divergence rate, (iii) they have some features observed in housekeeping genes but for most of them there is no functional annotation and (iv) they may have an ancient origin involving a possible gene transfer from ancestral chloroplasts or bacteria to the plant nucleus.
Figures






Similar articles
-
Patterns of intron loss and gain in plants: intron loss-dominated evolution and genome-wide comparison of O. sativa and A. thaliana.Mol Biol Evol. 2007 Jan;24(1):171-81. doi: 10.1093/molbev/msl159. Epub 2006 Oct 25. Mol Biol Evol. 2007. PMID: 17065597
-
Genes of cyanobacterial origin in plant nuclear genomes point to a heterocyst-forming plastid ancestor.Mol Biol Evol. 2008 Apr;25(4):748-61. doi: 10.1093/molbev/msn022. Epub 2008 Jan 24. Mol Biol Evol. 2008. PMID: 18222943
-
Extensive divergence in alternative splicing patterns after gene and genome duplication during the evolutionary history of Arabidopsis.Mol Biol Evol. 2010 Jul;27(7):1686-97. doi: 10.1093/molbev/msq054. Epub 2010 Feb 25. Mol Biol Evol. 2010. PMID: 20185454
-
Insights into the structural and functional evolution of plant genomes afforded by the nucleotide sequences of chromosomes 2 and 4 of Arabidopsis thaliana.Yeast. 2000 Apr;17(1):1-5. doi: 10.1002/(SICI)1097-0061(200004)17:1<1::AID-YEA3>3.0.CO;2-V. Yeast. 2000. PMID: 10797596 Free PMC article. Review.
-
Genome duplication and gene-family evolution: the case of three OXPHOS gene families.Gene. 2008 Sep 15;421(1-2):1-6. doi: 10.1016/j.gene.2008.05.011. Epub 2008 Jun 23. Gene. 2008. PMID: 18573316 Review.
Cited by
-
Welcome to the big leaves: Best practices for improving genome annotation in non-model plant genomes.Appl Plant Sci. 2023 Aug 8;11(4):e11533. doi: 10.1002/aps3.11533. eCollection 2023 Jul-Aug. Appl Plant Sci. 2023. PMID: 37601314 Free PMC article.
-
TC-motifs at the TATA-box expected position in plant genes: a novel class of motifs involved in the transcription regulation.BMC Genomics. 2010 Mar 12;11:166. doi: 10.1186/1471-2164-11-166. BMC Genomics. 2010. PMID: 20222994 Free PMC article.
-
Transcriptome and Metabolome Analyses of the Salt Stress Response Mechanism in Lonicera caerulea.Biology (Basel). 2025 May 31;14(6):641. doi: 10.3390/biology14060641. Biology (Basel). 2025. PMID: 40563892 Free PMC article.
-
Genome-wide analysis reveals diverged patterns of codon bias, gene expression, and rates of sequence evolution in picea gene families.Genome Biol Evol. 2015 Mar 5;7(4):1002-15. doi: 10.1093/gbe/evv044. Genome Biol Evol. 2015. PMID: 25747252 Free PMC article.
-
Convergent gene loss following gene and genome duplications creates single-copy families in flowering plants.Proc Natl Acad Sci U S A. 2013 Feb 19;110(8):2898-903. doi: 10.1073/pnas.1300127110. Epub 2013 Feb 4. Proc Natl Acad Sci U S A. 2013. PMID: 23382190 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources