Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Feb 5;96(2):329-39.
doi: 10.1016/j.ajhg.2014.12.021. Epub 2015 Jan 29.

Adjusting for heritable covariates can bias effect estimates in genome-wide association studies

Affiliations

Adjusting for heritable covariates can bias effect estimates in genome-wide association studies

Hugues Aschard et al. Am J Hum Genet. .

Abstract

In recent years, a number of large-scale genome-wide association studies have been published for human traits adjusted for other correlated traits with a genetic basis. In most studies, the motivation for such an adjustment is to discover genetic variants associated with the primary outcome independently of the correlated trait. In this report, we contend that this objective is fulfilled when the tested variants have no effect on the covariate or when the correlation between the covariate and the outcome is fully explained by a direct effect of the covariate on the outcome. For all other scenarios, an unintended bias is introduced with respect to the primary outcome as a result of the adjustment, and this bias might lead to false positives. Here, we illustrate this point by providing examples from published genome-wide association studies, including large meta-analysis of waist-to-hip ratio and waist circumference adjusted for body mass index (BMI), where genetic effects might be biased as a result of adjustment for body mass index. Using both theory and simulations, we explore this phenomenon in detail and discuss the ramifications for future genome-wide association studies of correlated traits and diseases.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Underlying Causal Diagrams Four causal diagrams describing the causal relationship between the genotypes G, environment E, a covariate C, and the outcome of interest Y are shown. In (A), the correlation between Y and C is due to a direct effect of C on Y, whereas in (B)–(D,) the correlation between Y and C is explained by shared risk factors.
Figure 2
Figure 2
Effect Estimates and False Discovery Rate Results for simulations of correlated outcomes and covariates and a genetic variant that influences the covariate only are shown. In (A), the average observed bias of the genetic effect estimate in the covariate adjusted analysis is plotted as a function of the correlation between the outcome and the covariate for different values of direct genetic effect on the covariate. The dashed lines correspond to the theoretical bias as derived in the method section. In (B), the average false discovery rate (α = 0.05) of over 5,000 replicates is plotted as a function of ρY,C the correlation between the outcome and the covariate for different values of direct genetic effect on the covariate when simulating 2,000 individuals.
Figure 3
Figure 3
Heritability of Adjusted Phenotypes We compared the heritability of a given phenotype against the heritability estimated after adjustment for a correlated variable. We simulated a trait Y adjusted for a correlated trait C. The genetic variance of each trait (upper panel) splits into trait-specific effects, shared effects, and shared loci with opposite effects. We vary heritability of Y and C from 0.8 (A), 0.5 (B), and 0.2 (C) and the proportion of shared environmental variance (bottom panel) from 0 to 1.

Comment in

  • A Robust Example of Collider Bias in a Genetic Association Study.
    Day FR, Loh PR, Scott RA, Ong KK, Perry JR. Day FR, et al. Am J Hum Genet. 2016 Feb 4;98(2):392-3. doi: 10.1016/j.ajhg.2015.12.019. Am J Hum Genet. 2016. PMID: 26849114 Free PMC article. No abstract available.
  • Response to Day et al.
    Aschard H, Vilhjálmsson BJ, Joshi AD, Price AL, Kraft P. Aschard H, et al. Am J Hum Genet. 2016 Feb 4;98(2):394-5. doi: 10.1016/j.ajhg.2015.12.020. Am J Hum Genet. 2016. PMID: 26849115 Free PMC article. No abstract available.

References

    1. Price A.L., Patterson N.J., Plenge R.M., Weinblatt M.E., Shadick N.A., Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 2006;38:904–909. - PubMed
    1. Pickrell J.K., Marioni J.C., Pai A.A., Degner J.F., Engelhardt B.E., Nkadori E., Veyrieras J.B., Stephens M., Gilad Y., Pritchard J.K. Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature. 2010;464:768–772. - PMC - PubMed
    1. Mefford J., Witte J.S. The Covariate’s Dilemma. PLoS Genet. 2012;8:e1003096. - PMC - PubMed
    1. Pirinen M., Donnelly P., Spencer C. Efficient computation with a linear mixed model on large-scale data sets with applications to genetic studies. Ann. Appl. Stat. 2013;7:369–390.
    1. Kaplan R.C., Petersen A.K., Chen M.H., Teumer A., Glazer N.L., Döring A., Lam C.S., Friedrich N., Newman A., Müller M. A genome-wide association study identifies novel loci associated with circulating IGF-I and IGFBP-3. Hum. Mol. Genet. 2011;20:1241–1251. - PMC - PubMed

Publication types