Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jul;29(7):1082-1091.
doi: 10.1038/s41431-021-00836-7. Epub 2021 Mar 4.

The limits of normal approximation for adult height

Affiliations

The limits of normal approximation for adult height

Sergei A Slavskii et al. Eur J Hum Genet. 2021 Jul.

Abstract

Adult height inspired the first biometrical and quantitative genetic studies and is a test-case trait for understanding heritability. The studies of height led to formulation of the classical polygenic model, that has a profound influence on the way we view and analyse complex traits. An essential part of the classical model is an assumption of additivity of effects and normality of the distribution of the residuals. However, it may be expected that the normal approximation will become insufficient in bigger studies. Here, we demonstrate that when the height of hundreds of thousands of individuals is analysed, the model complexity needs to be increased to include non-additive interactions between sex, environment and genes. Alternatively, the use of log-normal approximation allowed us to still use the additive effects model. These findings are important for future genetic and methodologic studies that make use of adult height as an exemplar trait.

PubMed Disclaimer

Conflict of interest statement

YSA is a founder of PolyOmica and PolyKnomics, private organisations, providing services, research and development in the field of computational and statistical (gen)omics. All other authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Relation between parameters of the distribution of adult human height across populations.
Linear regression of standard deviation (A) and CV (B) of height on mean height of women from ref. [20]. The dashed line shows the overall mean. (C) Linear regression of mean male height on mean female height in populations from ref. [21]. Unweighted linear regression was used to estimate the trend (k), its standard error (SE), the adjusted R2 and, in brackets, the significance of deviation of the regression coefficient from zero for A, B and from one for C (p < 0.001–***; p < 0.01–*; p > 0.05—ns) (shown at the top of each panel).
Fig. 2
Fig. 2. Changes of SD with the mean height and log-height in UK Biobank.
Relation of standard deviation to mean of height (A) and log-height (B) for six groups of British individuals of white descent from UK Biobank, defined based on place of birth and split by sex, median polygenic score, and median residual predictor (48 groups in total). The size of a symbol is proportional to the regression weight, defined as twice the group size. Weighted linear regression was used to estimate the trend (k), its standard error (SE), the adjusted R2 and, in brackets, the significance of deviation of the regression coefficient from zero (p < 0.001–***; p > 0.05—ns) (shown at the top of each panel).
Fig. 3
Fig. 3. Changes of the effects of different factors with the mean height and log-height in UK Biobank.
Relation between the estimate of the effect size of sex (A, B), genotype (C, D; genotype was defined as a polygenic height score, PGHS), other factors (E, F; a linear residual predictor, RP, combining sociodemographic and study covariates) and mean height (A, C, E) and log-height (B, D, F) for six groups of British individuals of white descent from UK Biobank, defined based on place of birth. The six groups are additionally split by sex (CF), median polygenic height score (A, B, E, F), and median residual predictor (AD). The size of a symbol is proportional to the group size (used as the regression weight). Weighted linear regression was used to estimate the trend (k), its standard error (SE), the adjusted R2 and, in brackets, the significance of deviation of the regression coefficient from zero (p < 0.001–***; p > 0.05—ns) (shown at the top of each panel).

References

    1. Galton F. Regression towards mediocrity in hereditary stature. J Anthropological Inst Gt Br Irel. 1886;15:246–63. doi: 10.2307/2841583. - DOI
    1. Pearson K, Lee A. On the Laws of Inheritance in Man: I. Inheritance of Physical Characters. Biometrika. 1903;2:357–462. doi: 10.2307/2331507. - DOI
    1. Fisher RA. XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. Earth Environ Sci Trans R Soc Edinb. 1918;52:399–433. doi: 10.1017/S0080456800012163. - DOI
    1. Maher B. The case of the missing heritability: when scientists opened up the human genome, they expected to find the genetic components of common traits and diseases. But they were nowhere to be seen BrendanMaher shines a light on six places where the missing loot could be stashed away. Nature. 2008;456:18–22. doi: 10.1038/456018a. - DOI - PubMed
    1. Visscher PM. Sizing up human height variation. Nat Genet. 2008;40:489–90. doi: 10.1038/ng0508-489. - DOI - PubMed

Publication types

LinkOut - more resources