Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Dec;25(12):2699-707.
doi: 10.1093/molbev/msn214. Epub 2008 Sep 26.

An ancient evolutionary origin of genes associated with human genetic diseases

Affiliations

An ancient evolutionary origin of genes associated with human genetic diseases

Tomislav Domazet-Loso et al. Mol Biol Evol. 2008 Dec.

Abstract

Several thousand genes in the human genome have been linked to a heritable genetic disease. The majority of these appear to be nonessential genes (i.e., are not embryonically lethal when inactivated), and one could therefore speculate that they are late additions in the evolutionary lineage toward humans. Contrary to this expectation, we find that they are in fact significantly overrepresented among the genes that have emerged during the early evolution of the metazoa. Using a phylostratigraphic approach, we have studied the evolutionary emergence of such genes at 19 phylogenetic levels. The majority of disease genes was already present in the eukaryotic ancestor, and the second largest number has arisen around the time of evolution of multicellularity. Conversely, genes specific to the mammalian lineage are highly underrepresented. Hence, genes involved in genetic diseases are not simply a random subset of all genes in the genome but are biased toward ancient genes.

PubMed Disclaimer

Figures

F<sc>IG</sc>. 1.—
FIG. 1.—
Phylogenetic framework used in the search for the human gene origins. Taxa represented in the databases with complete genomes or a substantial amount of TRACE and EST data are in bold. Taxa in italics are represented in the databases only with small numbers of highly conserved genes, and their exclusion from the analysis does not influence the results.
F<sc>IG</sc>. 2.—
FIG. 2.—
Phylostratigraphy of all human genes and different classes of disease-causing genes. The total number of human genes (N = 22,845) found in the different phylostrata is plotted (blue line—squares, note logarithmic scale on the y axis). Distribution of the total number of evaluated disease genes (N = 1,760, red line—circles). The subsample of nonessential disease genes (N = 1,305), the stringent nonessential subsample (N = 1,020), and the genes involved in polygenic traits (N = 149) are also shown. The correlation coefficient between gene count and ranked evolutionary time is listed on top (estimated by Spearman's rank correlation coefficient).
F<sc>IG</sc>. 3.—
FIG. 3.—
Probabilities of over- or underrepresentation of disease-causing genes in the respective phylostrata. Log-odds ratios show how the frequency of disease genes in each phylostratum deviates from the expected one estimated from the whole set of genes. Numbering of the phylostrata corresponds to those in figures 1 and 2 (*P < 0.05; **P < 0.01; ***P < 0.001, two-tailed hypergeometric test corrected for multiple comparison by FDR at 0.05 level).

References

    1. Alba MM, Castresana J. On homology searches by protein Blast and the characterization of the age of genes. BMC Evol Biol. 2007;7:53. - PMC - PubMed
    1. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B. 1995;57:289–300.
    1. Bourlat SJ, Juliusdottir T, Lowe CJ, et al. (11 co-authors) Deuterostome phylogeny reveals monophyletic chordates and the new phylum Xenoturbellida. Nature. 2006;444:85–88. - PubMed
    1. Daubin V, Ochman H. Bacterial genomes as new gene homes: the genealogy of ORFans in E. coli. Genome Res. 2004;14(6):1036–1042. - PMC - PubMed
    1. de Duve C. The origin of eukaryotes: a reappraisal. Nat Rev Genet. 2007;8:395–403. - PubMed

Publication types