Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Sep;19(9):1553-61.
doi: 10.1101/gr.092619.109. Epub 2009 Jul 14.

Identification of deleterious mutations within three human genomes

Affiliations

Identification of deleterious mutations within three human genomes

Sung Chun et al. Genome Res. 2009 Sep.

Abstract

Each human carries a large number of deleterious mutations. Together, these mutations make a significant contribution to human disease. Identification of deleterious mutations within individual genome sequences could substantially impact an individual's health through personalized prevention and treatment of disease. Yet, distinguishing deleterious mutations from the massive number of nonfunctional variants that occur within a single genome is a considerable challenge. Using a comparative genomics data set of 32 vertebrate species we show that a likelihood ratio test (LRT) can accurately identify a subset of deleterious mutations that disrupt highly conserved amino acids within protein-coding sequences, which are likely to be unconditionally deleterious. The LRT is also able to identify known human disease alleles and performs as well as two commonly used heuristic methods, SIFT and PolyPhen. Application of the LRT to three human genomes reveals 796-837 deleterious mutations per individual, approximately 40% of which are estimated to be at <5% allele frequency. However, the overlap between predictions made by the LRT, SIFT, and PolyPhen, is low; 76% of predictions are unique to one of the three methods, and only 5% of predictions are shared across all three methods. Our results indicate that only a small subset of deleterious mutations can be reliably identified, but that this subset provides the raw material for personalized medicine.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Venn diagram of deleterious mutations identified in J. Craig Venter, James D. Watson, and a Han Chinese individual. The percentage of individual-specific deleterious mutations found in each genome is shown in parentheses.
Figure 2.
Figure 2.
Characteristics of deleterious mutations. (A) Deleterious mutations (n = 1928) are more likely to occur in recently duplicated genes relative to neutral variants (n = 8287). (B) Mutations at perfectly conserved sites, mutations that cause radical amino acid changes, defined by BLOSUM62 ≤ −2, and mutations to amino acids that are not observed outside of eutherian mammals are more frequent among rare (n = 807) compared with common deleterious mutations (n = 1121).
Figure 3.
Figure 3.
Comparison of SIFT, PolyPhen, and the likelihood ratio test (LRT) predictions. (A) Venn diagram of the number of predictions made by the three methods. Probably damaging mutations were used for PolyPhen. Numbers below and above each line are for the complete set of 7534 high-quality variants present within the Venter genome and a subset of 4303 where all three methods generated a prediction, respectively. (B) Overlap between the LRT and SIFT predictions based on the same alignments.

References

    1. Ahituv N, Kavaslar N, Schackwitz W, Ustaszewska A, Martin J, Hebert S, Doelle H, Ersoy B, Kryukov G, Schmidt S, et al. Medical sequencing at the extremes of human body mass. Am J Hum Genet. 2007;80:779–791. - PMC - PubMed
    1. Asthana S, Roytberg M, Stamatoyannopoulos J, Sunyaev S. Analysis of sequence conservation at nucleotide resolution. PLoS Comput Biol. 2007;3:e254. doi: 10.1371/journal.pcbi.0030254. - DOI - PMC - PubMed
    1. Boyko AR, Williamson SH, Indap AR, Degenhardt JD, Hernandez RD, Lohmueller KE, Adams MD, Schmidt S, Sninsky JJ, Sunyaev SR, et al. Assessing the evolutionary impact of amino acid mutations in the human genome. PLoS Genet. 2008;4:e1000083. doi: 10.1371/journal.pgen.1000083. - DOI - PMC - PubMed
    1. Brockman W, Alvarez P, Young S, Garber M, Giannoukos G, Lee WL, Russ C, Lander ES, Nusbaum C, Jaffe DB. Quality scores and SNP detection in sequencing-by-synthesis systems. Genome Res. 2008;18:763–770. - PMC - PubMed
    1. Bromberg Y, Rost B. Snap: Predict effect of non-synonymous polymorphisms on function. Nucleic Acids Res. 2007;35:3823–3835. - PMC - PubMed

Publication types

LinkOut - more resources