Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2011 Apr 8;88(4):458-68.
doi: 10.1016/j.ajhg.2011.03.008. Epub 2011 Mar 31.

A genome-wide comparison of the functional properties of rare and common genetic variants in humans

Affiliations
Comparative Study

A genome-wide comparison of the functional properties of rare and common genetic variants in humans

Qianqian Zhu et al. Am J Hum Genet. .

Abstract

One of the longest running debates in evolutionary biology concerns the kind of genetic variation that is primarily responsible for phenotypic variation in species. Here, we address this question for humans specifically from the perspective of population allele frequency of variants across the complete genome, including both coding and noncoding regions. We establish simple criteria to assess the likelihood that variants are functional based on their genomic locations and then use whole-genome sequence data from 29 subjects of European origin to assess the relationship between the functional properties of variants and their population allele frequencies. We find that for all criteria used to assess the likelihood that a variant is functional, the rarer variants are significantly more likely to be functional than the more common variants. Strikingly, these patterns disappear when we focus on only those variants in which the major alleles are derived. These analyses indicate that the majority of the genetic variation in terms of phenotypic consequence may result from a mutation-selection balance, as opposed to balancing selection, and have direct relevance to the study of human disease.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The Distribution of SNVs in MAF Bins The MAF range of each bin is shown in Table 1.
Figure 2
Figure 2
The Cumulative Distribution Plot of Conservation Scores Corresponding to SNVs in Each MAF Bin Insets shows the conservation score distribution of SNVs in the first (red line) and last bin (blue line). Only the distribution of conservation scores for primates is shown. Consistent results were obtained using conservation scores for placental mammals or vertebrates (Figures S1 and S2).
Figure 3
Figure 3
The Enrichment of SNVs from Each MAF Bin in Functional Regions The enrichment of SNVs from each MAF bin in units of gene structure (A) and regulatory regions (B). The relative fraction is calculated as the fraction of SNVs from each bin falling inside a particular type of functional region divided by the fraction corresponding to SNVs from the first bin. The result corresponding to protein-coding genes is very close to the result corresponding to genes, and therefore is not shown in the figure. The relationship between MAF and SNV enrichment in 5′UTR is not significant and therefore is not shown either.
Figure 4
Figure 4
The Enrichment of SNVs in Functional Regions when Minor Alleles Are Ancestral or Derived The enrichment of SNVs in which minor alleles are ancestral (A) or derived (B) in functional regions. The relative fraction is calculated as the fraction of SNVs from each bin falling inside a particular type of functional region divided by the fraction corresponding to SNVs from the first bin. The relationship between MAF and SNV enrichment in extremely conserved noncoding elements is not shown in (A) because only 189 SNVs are in this group, and no significant pattern is observed.
Figure 5
Figure 5
Simulation Results Corresponding to Varying Proportions of Purifying and Positive Selection ΔOR measures the enrichment of selected SNVs in rare SNVs versus common SNVs between two situations: minor allele is ancestral and minor allele is derived (see Material and Methods). The x axis is the ratio between the proportion of purifying selection and positive selection. The red line corresponds to the value observed from nonsynonymous SNVs in our real data when assuming all nonsynonymous SNVs are under selection.

Similar articles

Cited by

  • Challenges imposed by minor reference alleles on the identification and reporting of clinical variants from exome data.
    Koko M, Abdallah MOE, Amin M, Ibrahim M. Koko M, et al. BMC Genomics. 2018 Jan 15;19(1):46. doi: 10.1186/s12864-018-4433-3. BMC Genomics. 2018. PMID: 29334895 Free PMC article.
  • Evidence of inbreeding depression on human height.
    McQuillan R, Eklund N, Pirastu N, Kuningas M, McEvoy BP, Esko T, Corre T, Davies G, Kaakinen M, Lyytikäinen LP, Kristiansson K, Havulinna AS, Gögele M, Vitart V, Tenesa A, Aulchenko Y, Hayward C, Johansson A, Boban M, Ulivi S, Robino A, Boraska V, Igl W, Wild SH, Zgaga L, Amin N, Theodoratou E, Polašek O, Girotto G, Lopez LM, Sala C, Lahti J, Laatikainen T, Prokopenko I, Kals M, Viikari J, Yang J, Pouta A, Estrada K, Hofman A, Freimer N, Martin NG, Kähönen M, Milani L, Heliövaara M, Vartiainen E, Räikkönen K, Masciullo C, Starr JM, Hicks AA, Esposito L, Kolčić I, Farrington SM, Oostra B, Zemunik T, Campbell H, Kirin M, Pehlic M, Faletra F, Porteous D, Pistis G, Widén E, Salomaa V, Koskinen S, Fischer K, Lehtimäki T, Heath A, McCarthy MI, Rivadeneira F, Montgomery GW, Tiemeier H, Hartikainen AL, Madden PA, d'Adamo P, Hastie ND, Gyllensten U, Wright AF, van Duijn CM, Dunlop M, Rudan I, Gasparini P, Pramstaller PP, Deary IJ, Toniolo D, Eriksson JG, Jula A, Raitakari OT, Metspalu A, Perola M, Järvelin MR, Uitterlinden A, Visscher PM, Wilson JF; ROHgen Consortium. McQuillan R, et al. PLoS Genet. 2012;8(7):e1002655. doi: 10.1371/journal.pgen.1002655. Epub 2012 Jul 19. PLoS Genet. 2012. PMID: 22829771 Free PMC article.
  • Genome-wide rare variant analysis for thousands of phenotypes in over 70,000 exomes from two cohorts.
    Cirulli ET, White S, Read RW, Elhanan G, Metcalf WJ, Tanudjaja F, Fath DM, Sandoval E, Isaksson M, Schlauch KA, Grzymski JJ, Lu JT, Washington NL. Cirulli ET, et al. Nat Commun. 2020 Jan 28;11(1):542. doi: 10.1038/s41467-020-14288-y. Nat Commun. 2020. PMID: 31992710 Free PMC article.
  • Characterizing the genetic basis of transcriptome diversity through RNA-sequencing of 922 individuals.
    Battle A, Mostafavi S, Zhu X, Potash JB, Weissman MM, McCormick C, Haudenschild CD, Beckman KB, Shi J, Mei R, Urban AE, Montgomery SB, Levinson DF, Koller D. Battle A, et al. Genome Res. 2014 Jan;24(1):14-24. doi: 10.1101/gr.155192.113. Epub 2013 Oct 3. Genome Res. 2014. PMID: 24092820 Free PMC article.
  • Biochemical characterization of eight genetic variants of human DNA polymerase κ involved in error-free bypass across bulky N(2)-guanyl DNA adducts.
    Song I, Kim EJ, Kim IH, Park EM, Lee KE, Shin JH, Guengerich FP, Choi JY. Song I, et al. Chem Res Toxicol. 2014 May 19;27(5):919-30. doi: 10.1021/tx500072m. Epub 2014 Apr 21. Chem Res Toxicol. 2014. PMID: 24725253 Free PMC article.

References

    1. Manolio T.A., Collins F.S., Cox N.J., Goldstein D.B., Hindorff L.A., Hunter D.J., McCarthy M.I., Ramos E.M., Cardon L.R., Chakravarti A. Finding the missing heritability of complex diseases. Nature. 2009;461:747–753. - PMC - PubMed
    1. Cirulli E.T., Goldstein D.B. Uncovering the roles of rare variants in common disease through whole-genome sequencing. Nat. Rev. Genet. 2010;11:415–425. - PubMed
    1. McClellan J., King M.C. Genetic heterogeneity in human disease. Cell. 2010;141:210–217. - PubMed
    1. Alexander R.P., Fang G., Rozowsky J., Snyder M., Gerstein M.B. Annotating non-coding regions of the genome. Nat. Rev. Genet. 2010;11:559–571. - PubMed
    1. Birney E., Stamatoyannopoulos J.A., Dutta A., Guigó R., Gingeras T.R., Margulies E.H., Weng Z., Snyder M., Dermitzakis E.T., Thurman R.E., ENCODE Project Consortium. NISC Comparative Sequencing Program. Baylor College of Medicine Human Genome Sequencing Center. Washington University Genome Sequencing Center. Broad Institute. Children's Hospital Oakland Research Institute Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007;447:799–816. - PMC - PubMed

Publication types

LinkOut - more resources