Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 May 5;106(18):7273-80.
doi: 10.1073/pnas.0901808106. Epub 2009 Apr 7.

The universal distribution of evolutionary rates of genes and distinct characteristics of eukaryotic genes of different apparent ages

Affiliations

The universal distribution of evolutionary rates of genes and distinct characteristics of eukaryotic genes of different apparent ages

Yuri I Wolf et al. Proc Natl Acad Sci U S A. .

Abstract

The evolutionary rates of protein-coding genes in an organism span, approximately, 3 orders of magnitude and show a universal, approximately log-normal distribution in a broad variety of species from prokaryotes to mammals. This universal distribution implies a steady-state process, with identical distributions of evolutionary rates among genes that are gained and genes that are lost. A mathematical model of such process is developed under the single assumption of the constancy of the distributions of the propensities for gene loss (PGL). This model predicts that genes of different ages, that is, genes with homologs detectable at different phylogenetic depths, substantially differ in those variables that correlate with PGL. We computationally partition protein-coding genes from humans, flies, and Aspergillus fungus into age classes, and show that genes of different ages retain the universal log-normal distribution of evolutionary rates, with a shift toward higher rates in "younger" classes but also with a substantial overlap. The only exception involves human primate-specific genes that show a heavy tail of rapidly evolving genes, probably owing to gene annotation artifacts. As predicted, the gene age classes differ in characteristics correlated with PGL. Compared with "young" genes (e.g., mammal-specific human ones), "old" genes (e.g., eukaryote-specific), on average, are longer, are expressed at a higher level, possess a higher intron density, evolve slower on the short time scale, and are subject to stronger purifying selection. Thus, genome evolution fits a simple model with approximately uniform rates of gene gain and loss, without major bursts of genomic innovation.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
Distributions of nucleotide sequence evolution rates for pairs of closely related eukaryotic, archaeal, and bacterial genomes. The evolutionary distances were calculated using the Jukes–Cantor correction and normalized so that the mean of each distribution was equal to 1. Metma, Methanococcus maripaludis C5 vs. M. maripaludis C7 (Euryarchaeota); Bursp, Burkholderia cenocepacia MC0–3 vs. B. vietnamiensis G4 (Proteobacteria); Salsp, Salinispora arenicola CNS-205 vs. S. tropica CNB-440 (Actinobacteria). The probability density curves were obtained by Gaussian-kernel smoothing of the individual data points (64).
Fig. 2.
Fig. 2.
Partitioning of eukaryotic gene sets into age classes. The filled portion of each bar shows those genes for which likely orthologs were identified in the corresponding closely related species, and the empty portion shows the remainder of the genes for which orthologs were not detected. (Top) Homo sapiens (orthologs in Macaca mulatta). (Middle) Drosophila melanogaster (orthologs in D. simulans). (Bottom) Aspergillus fumigatus (orthologs in Neosartorya fischeri).
Fig. 3.
Fig. 3.
Distributions of nucleotide sequence evolution rates for different age classes of eukaryotic genes. The evolutionary distances were calculated as in Fig. 1 but not normalized. (Top) H. sapiens (vs. M. mulatta). (Middle) D. melanogaster (vs. D. simulans). (Bottom) A. fumigatus (vs. N. fischeri). The probability density curves were obtained by Gaussian-kernel smoothing of the individual data points (64).
Fig. 4.
Fig. 4.
Expression of human genes from different age classes. (Top) The median EST counts. (Middle) The median expression level in microarray experiments. (Bottom) The median expression breadth (number of tissues where the gene is expressed) in microarray experiments.

References

    1. Doolittle WF. Lateral genomics. Trends Cell Biol. 1999;9:M5–M8. - PubMed
    1. Ochman H, Lawrence JG, Groisman EA. Lateral gene transfer and the nature of bacterial innovation. Nature. 2000;405:299–304. - PubMed
    1. Koonin EV, Wolf YI. Genomics of Bacteria and Archaea: The emerging dynamic view of the prokaryotic world. Nucleic Acids Res. 2008;36:6688–6719. - PMC - PubMed
    1. Embley TM, Martin W. Eukaryotic evolution, changes and challenges. Nature. 2006;440:623–630. - PubMed
    1. Esser C, et al. A genome phylogeny for mitochondria among alpha-proteobacteria and a predominantly eubacterial ancestry of yeast nuclear genes. Mol Biol Evol. 2004;21:1643–1660. - PubMed

Publication types