Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 May;9(5):e1003484.
doi: 10.1371/journal.pgen.1003484. Epub 2013 May 9.

From mouse to human: evolutionary genomics analysis of human orthologs of essential genes

Affiliations

From mouse to human: evolutionary genomics analysis of human orthologs of essential genes

Benjamin Georgi et al. PLoS Genet. 2013 May.

Abstract

Understanding the core set of genes that are necessary for basic developmental functions is one of the central goals in biology. Studies in model organisms identified a significant fraction of essential genes through the analysis of null-mutations that lead to lethality. Recent large-scale next-generation sequencing efforts have provided unprecedented data on genetic variation in human. However, evolutionary and genomic characteristics of human essential genes have never been directly studied on a genome-wide scale. Here we use detailed phenotypic resources available for the mouse and deep genomics sequencing data from human populations to characterize patterns of genetic variation and mutational burden in a set of 2,472 human orthologs of known essential genes in the mouse. Consistent with the action of strong, purifying selection, these genes exhibit comparatively reduced levels of sequence variation, skew in allele frequency towards more rare, and exhibit increased conservation across the primate and rodent lineages relative to the remainder of genes in the genome. In individual genomes we observed ~12 rare mutations within essential genes predicted to be damaging. Consistent with the hypothesis that mutations in essential genes are risk factors for neurodevelopmental disease, we show that de novo variants in patients with Autism Spectrum Disorder are more likely to occur in this collection of genes. While incomplete, our set of human orthologs shows characteristics fully consistent with essential function in human and thus provides a resource to inform and facilitate interpretation of sequence data in studies of human disease.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Functional and evolutionary characteristics of essential genes.
A) Distribution of the 2472 essential genes (EG) across the genome (obtained from www.ensembl.org). B) Essential genes are significantly enriched in HGMD disease genes (P = 1.73×10−14), haploinsufficient genes (P = 1.75×10−33) and ubiquitously expressed genes (P = 9.23×10−21) when compared to NLG. C) Comparison of non-synonymous to synonymous substitution rates between human and rhesus, chimp, mouse and rat in EG, NLG and ALL. Plotted substitution rates are normalized to Z-scores relative to the genome average.
Figure 2
Figure 2. Population genetics properties of essential genes.
A) Average numbers of exonic missense variants in EG, NLG and ALL. The plotted Z-score is normalized relative to the genome average. The plotted range is truncated to visualize differences between gene sets, with a full log-transformed plot available in Figure S7. B) Differences in the allele frequency distributions in four continental populations of the 1000G data for EG, NLG and ALL. A data point above the zero line corresponds to a relative excess of variants of a given allele frequency. It can be seen that the essential genes contain significantly more rare variants than either NLG or ALL. The reported p-values are with respect to all 1000 Genome samples combined.
Figure 3
Figure 3. Analysis of individual mutational load in essential genes.
The boxes span the lower and upper quartile with the median indicated by a red bar; whiskers extend to data points less than 1.5 times the interquartile range. Values are transformed to Z-scores relative to the genome average of all protein coding genes. The P-values given are for the comparison of EG versus NLG (top) and EG versus ALL (bottom). A) Ratio of non-synonymous to synonymous exonic variants. B) Gene-length corrected average number of exonic missense variants. C) Fraction of loss-of-function variants among all exonic missense variants. D) Estimates of mutational load in essential genes in each human genome at different allele frequencies. The plots show all exonic missense variants (blue), putative damaging exonic variants (orange) and loss-of-function variants (red). Error bars depict the standard deviation.
Figure 4
Figure 4. Analysis of enrichment of essential genes among genes with de novo mutations in ASD families from four recent studies.
Gene-length and GC content adjusted odds ratios (OR) and P-values for enrichment in EG in either (A) ASD affected probands or (B) family-based controls (i.e. unaffected siblings) are shown.

Similar articles

Cited by

References

    1. Sanders SJ, Murtha MT, Gupta AR, Murdoch JD, Raubeson MJ, et al. (2012) De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature 485: 237–241. - PMC - PubMed
    1. Ng BG, Hackmann K, Jones MA, Eroshkin AM, He P, et al. (2012) Mutations in the glycosylphosphatidylinositol gene PIGL cause CHIME syndrome. Am J Hum Genet 90: 685–688. - PMC - PubMed
    1. Emond MJ, Louie T, Emerson J, Zhao W, Mathias RA, et al. (2012) Exome sequencing of extreme phenotypes identifies DCTN4 as a modifier of chronic Pseudomonas aeruginosa infection in cystic fibrosis. Nat Genet - PMC - PubMed
    1. Morton NE, Crow JF, Muller HJ (1956) An Estimate of the Mutational Damage in Man from Data on Consanguineous Marriages. Proc Natl Acad Sci U S A 42: 855–863. - PMC - PubMed
    1. Bittles AH, Neel JV (1994) The costs of human inbreeding and their implications for variations at the DNA level. Nat Genet 8: 117–121. - PubMed

Publication types