Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jul;42(7):565-9.
doi: 10.1038/ng.608. Epub 2010 Jun 20.

Common SNPs explain a large proportion of the heritability for human height

Affiliations

Common SNPs explain a large proportion of the heritability for human height

Jian Yang et al. Nat Genet. 2010 Jul.

Abstract

SNPs discovered by genome-wide association studies (GWASs) account for only a small fraction of the genetic variation of complex traits in human populations. Where is the remaining heritability? We estimated the proportion of variance for human height explained by 294,831 SNPs genotyped on 3,925 unrelated individuals using a linear model analysis, and validated the estimation method with simulations based on the observed genotype data. We show that 45% of variance can be explained by considering all SNPs simultaneously. Thus, most of the heritability is not missing but has not previously been detected because the individual effects are too small to pass stringent significance tests. We provide evidence that the remaining heritability is due to incomplete linkage disequilibrium between causal variants and genotyped SNPs, exacerbated by causal variants having lower minor allele frequency than the SNPs explored to date.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Prediction error of genetic relationship. The genetic relationship at unobserved causal loci is predicted, with error, from the relationship estimated from genotyped SNPs. The prediction error is calibrated by comparing the relationship at causal loci (mimicked by a set of random SNPs with MAF ≤ θ) to that estimated from another set of random SNPs. Values plotted on y-axis are (1−β) var(Ajk) (see Online Methods for the notations) calculated from different numbers of random SNPs (N) in both adult and adolescent datasets. The slope of each line is equal to 1.0, with R2 = 1.0. The intercept (c) is constant for a certain MAF threshold θ, and c = 6.2 × 10−6 (p-value = 2 × 10−14), 3.4 × 10−6 (p-value = 9 × 10−12), 1.8 × 10−6 (p-value = 4 × 10−10), 7.8 × 10−7 (p-value = 2 × 10−7) and 9.2 × 10−9 (p-value = 0.87, not significant) for θ = 0.1, 0.2, 0.3, 0.4 and 0.5, respectively.
Figure 2
Figure 2
Estimates of variance explained by genome-wide SNPs from adjusted estimates of genetic relationships are unbiased. Results are shown as estimates of variance explained by different proportions of SNPs randomly selected from all the SNPs in the combined set. For each group of SNPs, the variance explained by genome-wide SNPs is estimated using both raw estimates of genetic relationships and adjusted estimates of genetic relationships correcting for prediction error (assuming c = 0). Error bars denote s.e. of the estimate of variance explained by genome-wide SNPs. The log-likelihood ratio test (LRT) statistic is calculated as twice the difference in log-likelihood between the full (h2 ≠ 0) and reduced (h2 = 0) models.
Figure 3
Figure 3
All pairwise comparisons contribute to the estimate of genetic variance. Shown are the squared z-score differences between individuals ( Δyjk2) plotted against the adjusted estimates of genetic relationship ( Ajk). The blue line is the linear regression line of Δyjk2 on Ajk. The intercept and regression coefficient are estimates of twice the phenotypic variance and minus twice the genetic variances, respectively. The intercept is 1.98 (s.e. = 0.001) and the regression coefficient is −1.01 (s.e = 0.27), consistent with estimates of the phenotypic and additive genetic variance of 0.990 and 0.505, respectively, and a proportion of variance explained by all SNPs of 0.51.

Comment in

Similar articles

Cited by

References

    1. Hindorff LA, et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci USA. 2009;106:9362–7. - PMC - PubMed
    1. Donnelly P. Progress and challenges in genome-wide association studies in humans. Nature. 2008;456:728–31. - PubMed
    1. Maher B. Personal genomes: The case of the missing heritability. Nature. 2008;456:18–21. - PubMed
    1. Manolio TA, et al. Finding the missing heritability of complex diseases. Nature. 2009;461:747–753. - PMC - PubMed
    1. Frazer KA, Murray SS, Schork NJ, Topol EJ. Human genetic variation and its contribution to complex traits. Nat Rev Genet. 2009;10:241–51. - PubMed

Publication types