Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul;33(7):1032-1041.
doi: 10.1101/gr.277664.123. Epub 2023 May 17.

Leveraging family data to design Mendelian randomization that is provably robust to population stratification

Affiliations

Leveraging family data to design Mendelian randomization that is provably robust to population stratification

Nathan LaPierre et al. Genome Res. 2023 Jul.

Abstract

Mendelian randomization (MR) has emerged as a powerful approach to leverage genetic instruments to infer causality between pairs of traits in observational studies. However, the results of such studies are susceptible to biases owing to weak instruments, as well as the confounding effects of population stratification and horizontal pleiotropy. Here, we show that family data can be leveraged to design MR tests that are provably robust to confounding from population stratification, assortative mating, and dynastic effects. We show in simulations that our approach, MR-Twin, is robust to confounding from population stratification and is not affected by weak instrument bias, whereas standard MR methods yield inflated false positive rates. We then conduct an exploratory analysis of MR-Twin and other MR methods applied to 121 trait pairs in the UK Biobank data set. Our results suggest that confounding from population stratification can lead to false positives for existing MR methods, whereas MR-Twin is immune to this type of confounding, and that MR-Twin can help assess whether traditional approaches may be inflated owing to confounding from population stratification.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Illustrations of Mendelian randomization (MR) assumptions and the MR-Twin framework. (A) Directed acyclic graph (DAG) depicting variables and their relationships in a typical MR study, where X is the genotypic instrument, E is the exposure trait, and O is the outcome trait. An external confounder Z, such as population stratification, can cause violations of the MR assumptions. (B) If we have the parental haplotypes A, then X is independent of Z given A. (C) Illustration of the MR-Twin workflow. Digital twin genotypes are sampled from the parental genotypes. MR-Twin is a conditional randomization test, conditioned on A and therefore immune to confounding from Z, in which the P-value is computed based on the quantile of the true offspring's MR-Twin statistic compared with the digital twins’ statistics.
Figure 2.
Figure 2.
False-positive rate (FPR) and power comparison between various methods run on simulated data. (A) FPR (y-axis) under varying levels of confounding owing to population stratification (PS), with the x-axis describing the magnitude of the confounding effect of population labels on the exposure and outcome trait. (B) Power (y-axis) as a function of the magnitude of the causal effect of the exposure on the outcome trait (x-axis) in a setting with no confounding. Results are averaged over 1000 simulation replicates.
Figure 3.
Figure 3.
FPR and power comparison between various methods run on simulated trio data. This is similar to Figure 2 except that IVW, Egger, Median, and Mode are run on the offspring of the trio data set instead of the large “external” group of unrelated individuals, such that all methods have the same sample size. (A) FPR (y-axis) under varying levels of confounding owing to population stratification (PS), with the x-axis describing the magnitude of the effect of the population labels on the exposure and outcome trait. (B) Power (y-axis) as a function of the causal effect size (x-axis). Results are averaged over 1000 simulation replicates.

Update of

References

    1. Abecasis GR, Cardon LR, Cookson W. 2000. A general test of association for quantitative traits in nuclear families. Am J Hum Genet 66: 279–292. 10.1086/302698 - DOI - PMC - PubMed
    1. Balding DJ, Nichols RA. 1995. A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity. Genetica 96: 3–12. 10.1007/BF01441146 - DOI - PubMed
    1. Bates S, Sesia M, Sabatti C, Candès E. 2020. Causal inference in genetic trio studies. Proc Natl Acad Sci 117: 24117–24126. 10.1073/pnas.2007743117 - DOI - PMC - PubMed
    1. Benner C, Havulinna AS, Järvelin MR, Salomaa V, Ripatti S, Pirinen M. 2017. Prospects of fine-mapping trait-associated genomic regions by using summary statistics from genome-wide association studies. Am J Hum Genet 101: 539–551. 10.1016/j.ajhg.2017.08.012 - DOI - PMC - PubMed
    1. Berg JJ, Harpak A, Sinnott-Armstrong N, Joergensen AM, Mostafavi H, Field Y, Boyle EA, Zhang X, Racimo F, Pritchard JK, et al. 2019. Reduced signal for polygenic adaptation of height in UK Biobank. eLife 8: e39725. 10.7554/eLife.39725 - DOI - PMC - PubMed

Publication types