Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Dec 7;91(6):1022-32.
doi: 10.1016/j.ajhg.2012.10.015.

Deleterious- and disease-allele prevalence in healthy individuals: insights from current predictions, mutation databases, and population-scale resequencing

Affiliations

Deleterious- and disease-allele prevalence in healthy individuals: insights from current predictions, mutation databases, and population-scale resequencing

Yali Xue et al. Am J Hum Genet. .

Abstract

We have assessed the numbers of potentially deleterious variants in the genomes of apparently healthy humans by using (1) low-coverage whole-genome sequence data from 179 individuals in the 1000 Genomes Pilot Project and (2) current predictions and databases of deleterious variants. Each individual carried 281-515 missense substitutions, 40-85 of which were homozygous, predicted to be highly damaging. They also carried 40-110 variants classified by the Human Gene Mutation Database (HGMD) as disease-causing mutations (DMs), 3-24 variants in the homozygous state, and many polymorphisms putatively associated with disease. Whereas many of these DMs are likely to represent disease-allele-annotation errors, between 0 and 8 DMs (0-1 homozygous) per individual are predicted to be highly damaging, and some of them provide information of medical relevance. These analyses emphasize the need for improved annotation of disease alleles both in mutation databases and in the primary literature; some HGMD mutation data have been recategorized on the basis of the present findings, an iterative process that is both necessary and ongoing. Our estimates of deleterious-allele numbers are likely to be subject to both overcounting and undercounting. However, our current best mean estimates of ~400 damaging variants and ~2 bona fide disease mutations per individual are likely to increase rather than decrease as sequencing studies ascertain rare variants more effectively and as additional disease alleles are discovered.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Condel-Score Distribution for HGMD-Only DMs, Variants Only in the 1000 Genomes Low-Coverage Pilot, and Overlap Variants Shown are Condel scores for HGMD-only DMs (blue), variants only in the 1000 Genomes Low-Coverage Pilot data (yellow), and overlap variants (green). Condel scores range from 0 (the amino acid change is predicted to not damage the protein) to 1 (the amino acid change is predicted to damage the protein). The top decile of Condel scores is further subdivided in the panel on the right-hand side.
Figure 2
Figure 2
Representation of HGMD Variant Classes in the 1000 Genomes Low-Coverage Pilot (A) HGMD variants subdivided by functional category. The following abbreviations are used: DM, disease-causing mutation; FPs, in vitro or in vivo functional polymorphism; DFP, disease-associated polymorphism with additional supporting functional evidence; and DP, disease-associated polymorphism. (B) HGMD variants subdivided by variant type. The first four types are all SNPs.
Figure 3
Figure 3
Derived Allele-Frequency Spectra of Nonsynonymous and Synonymous Variants and HGMD SNPs in Three 1000 Genomes Low-Coverage Pilot Samples The following abbreviations are used: NSV, nonsynonymous variant; SV, synonymous variant; DM, disease-causing mutation; FP, in vitro or in vivo functional polymorphism; DFP, disease-associated polymorphism with additional supporting functional evidence; and DP, disease-associated polymorphism.

References

    1. Stenson P.D., Mort M., Ball E.V., Howells K., Phillips A.D., Thomas N.S., Cooper D.N. The Human Gene Mutation Database: 2008 update. Genome Med. 2009;1:13. - PMC - PubMed
    1. Adzhubei I.A., Schmidt S., Peshkin L., Ramensky V.E., Gerasimova A., Bork P., Kondrashov A.S., Sunyaev S.R. A method and server for predicting damaging missense mutations. Nat. Methods. 2010;7:248–249. - PMC - PubMed
    1. Ng P.C., Henikoff S. SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003;31:3812–3814. - PMC - PubMed
    1. Bell C.J., Dinwiddie D.L., Miller N.A., Hateley S.L., Ganusova E.E., Mudge J., Langley R.J., Zhang L., Lee C.C., Schilkey F.D. Carrier testing for severe childhood recessive diseases by next-generation sequencing. Sci. Transl. Med. 2011;3:ra4. - PMC - PubMed
    1. Yngvadottir B., MacArthur D.G., Jin H., Tyler-Smith C. The promise and reality of personal genomics. Genome Biol. 2009;10:237. - PMC - PubMed

Publication types

LinkOut - more resources