Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Oct 29;39(11):1539-1552.
doi: 10.1093/jbmr/zjae136.

Causal inference in health and disease: a review of the principles and applications of Mendelian randomization

Affiliations
Review

Causal inference in health and disease: a review of the principles and applications of Mendelian randomization

Catherine E Lovegrove et al. J Bone Miner Res. .

Abstract

Mendelian randomization (MR) is a genetic epidemiological technique that uses genetic variation to infer causal relationships between modifiable exposures and outcome variables. Conventional observational epidemiological studies are subject to bias from a range of sources; MR analyses can offer an advantage in that they are less prone to bias as they use genetic variants inherited at conception as "instrumental variables", which are proxies of an exposure. However, as with all research tools, MR studies must be carefully designed to yield valuable insights into causal relationships between exposures and outcomes, and to avoid biased or misleading results that undermine the validity of the causal inferences drawn from the study. In this review, we outline Mendel's laws of inheritance, the assumptions and principles that underlie MR, MR study designs and methods, and how MR analyses can be applied and reported. Using the example of serum phosphate concentrations on liability to kidney stone disease we illustrate how MR estimates may be visualized and, finally, we contextualize MR in bone and mineral research including exemplifying how this technique could be employed to inform clinical studies and future guidelines concerning BMD and fracture risk. This review provides a framework to enhance understanding of how MR may be used to triangulate evidence and progress research in bone and mineral metabolism as we strive to infer causal effects in health and disease.

Keywords: epidemiology; genetic association studies; human association studies; mendelian randomization; therapeutics.

Plain language summary

Mendelian randomization is a powerful tool used by researchers to understand how factors like lifestyle choices or medical conditions affect our health. Some research studies can be hampered by inaccurate information or measurements which mean the results are unreliable. Mendelian randomization uses genetic information to evade this problem. Because a person’s genetic information does not change the studies can provide more reliable conclusions about how different factors influence our health. This review provides an overview of Mendelian randomization principles, its applications, and how researchers in bone and mineral research have used it to uncover cause-and-effect relationships.

PubMed Disclaimer

Conflict of interest statement

M.V.H. is an employee of 23andMe and holds stock in 23andMe.

Figures

Figure 1
Figure 1
Pitfalls in observational studies. Confounding variable affecting exposure and outcome: measured confounders can be accounted for in multivariable analyses; unmeasured confounders can cause unreliable results. Residual confounding: confounding that persists despite adjusting for measured confounders. Reverse causality: possible effect of outcome on exposure, often neglected. Regression dilution bias: random errors in measurement causing bias in estimated association with outcome.
Figure 2
Figure 2
Analogy of an RCT and MR. MR leverages naturally occurring genetic variations as “instrumental variables” to mimic the random assignment of exposures, like random allocation in an RCT. This genetic randomization helps address confounding factors and strengthens causal inference when investigating the relationships between exposures, outcomes, and potential interventions.
Figure 3
Figure 3
Conditional assumptions in MR. (A) Relevance: The genetic variant(s) acting as a proxy for the exposure variable are strongly associated with the exposure. (B) Independence: The IV is independent of confounding factors, thus genetic association with confounding variables is random, akin to randomization in an RCT. (C) Exclusion restriction: The IV affects the outcome solely through the exposure variable and not by external, horizontally-pleiotropic, pathways.
Figure 4
Figure 4
Mediation MR. Mediation analysis estimates the effect of an exposure on an outcome via mediating factors (indirect mediating effect) and via mechanisms independent of the mediating factor (direct casual effect). The proportion mediated effect can subsequently be derived.
Figure 5
Figure 5
Visual representations of MR analyses of the effect of albumin-adjusted serum calcium concentrations on risk of kidney stone disease- reanalysis of data from Lovegrove et al. MR, Mendelian randomization; SNP, single nucleotide polymorphism. (A) Scatter plot of genetic associations and causal estimates. (B) Forest plot of variant-specific causal estimates, IVW, and MR-Egger estimates. (C) Funnel plot of variant-specific causal estimates. The estimate effect is plotted against the precision (reciprocal of standard error) of the estimate for each genetic variant. The vertical lines correspond to the estimates for IVW and MR-Egger estimates. (D) Forest plot of leave one out analyses. For each variant, the effect of removing it from the IV is plotted. (E) Radial plot. The x-axis represents the weigh attributed to each genetic variant and the y-axis represents its Z-statistic (point estimate divided by standard error). The greater the vertical distance between a genetic variant and the overall causal estimate, the greater its contribution to Cochran’s Q statistic of heterogeneity.
Figure 6
Figure 6
Assessing assumptions in MR. (A) Assumption in MR: MR assumes that there is an IV comprised of genetic variant(s) which are a proxy of the effect of an exposure variable in an outcome measure. (B) The F-statistic can evaluate the strength of association between the IV and exposure variable. (C) Robust MR methods, for example MR-Egger regression, can identify when genetic variants have pleiotropic effects and produce a reliable effect estimate so long as the magnitude of pleiotropic effects is not correlated with the variant-exposure effects. (D) Steiger filtering selects genetic variants that explain more variance in the exposure variable than the outcome and excludes those that fail to meet this condition to mitigate against mis-specification of the primary exposure.

References

    1. Vandenbroucke JP, von Elm E, Altman DG, et al. Strengthening the reporting of observational studies in epidemiology (STROBE): explanation and elaboration. Int J Surg. 2014;12(12):1500–1524. 10.1016/j.ijsu.2014.07.014 - DOI - PubMed
    1. Fewell Z, Davey Smith G, Sterne JAC. The impact of residual and unmeasured confounding in epidemiologic studies: a simulation study. Am J Epidemiol. 2007;166(6):646–655. 10.1093/aje/kwm165 - DOI - PubMed
    1. Smith GD, Ebrahim S. Data dredging, bias, or confounding: they can all get you into the BMJ and the Friday papers. BMJ. 2002;325(7378):1437–1438. 10.1136/bmj.325.7378.1437 - DOI - PMC - PubMed
    1. Zabor EC, Kaizer AM, Hobbs BP. Randomized controlled trials. Chest. 2020;158(1):S79–S87. 10.1016/j.chest.2020.03.013 - DOI - PMC - PubMed
    1. Hariton E, Locascio JJ. Randomised controlled trials—the gold standard for effectiveness research. BJOG Int J Obstet Gynaecol. 2018;125(13):1716. 10.1111/1471-0528.15199 - DOI - PMC - PubMed