Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 Jan 6:2023.01.05.522936.
doi: 10.1101/2023.01.05.522936.

Leveraging family data to design Mendelian Randomization that is provably robust to population stratification

Affiliations

Leveraging family data to design Mendelian Randomization that is provably robust to population stratification

Nathan LaPierre et al. bioRxiv. .

Update in

Abstract

Mendelian Randomization (MR) has emerged as a powerful approach to leverage genetic instruments to infer causality between pairs of traits in observational studies. However, the results of such studies are susceptible to biases due to weak instruments as well as the confounding effects of population stratification and horizontal pleiotropy. Here, we show that family data can be leveraged to design MR tests that are provably robust to confounding from population stratification, assortative mating, and dynastic effects. We demonstrate in simulations that our approach, MR-Twin, is robust to confounding from population stratification and is not affected by weak instrument bias, while standard MR methods yield inflated false positive rates. We applied MR-Twin to 121 trait pairs in the UK Biobank dataset and found that MR-Twin identifies likely causal trait pairs and does not identify trait pairs that are unlikely to be causal. Our results suggest that confounding from population stratification can lead to false positives for existing MR methods, while MR-Twin is immune to this type of confounding.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:. Illustrations of Mendelian Randomization assumptions and the MR-Twin framework.
(a) Directed Acyclic Graph (DAG) depicting variables and their relationships in a typical Mendelian Randomization (MR) study, where X is the genotypic instrument, E is the exposure trait, and. O is the outcome trait. An external confounder Z, such as population stratification, can cause violations of the MR assumptions. (b) If we have the parental haplotypes A, then X is independent of Z given A. (c) Illustration of the MR-Twin workflow. Digital twin genotypes are sampled from the parental genotypes. MR-Twin is a conditional randomization test, conditioned on A and therefore immune to confounding from Z, in which the p-value is computed based on the quantile of the true offspring’s MR-Twin statistic compared to the digital twins’ statistics.
Figure 2:
Figure 2:. False Positive Rate (FPR) and Power comparison between various methods run on simulated data.
(a) False positive rate (y-axis) under varying levels of confounding due to population stratification (PS), with the x-axis describing the magnitude of the confounding effect of population labels on the exposure and outcome trait. (b) Power (y-axis) as a function of the magnitude of the causal effect of the exposure on the outcome trait (x-axis) in a setting with no confounding. Results are averaged over 1000 simulation replicates.
Figure 3:
Figure 3:. False Positive Rate (FPR) and Power comparison between various methods run on simulated trio data.
This is similar to Figure 2 except that IVW, Egger, Median, and Mode are run on the offsprings of the trio dataset instead of the large “external” group of unrelated individuals, such that all methods have the same sample size. (a) False positive rate (y-axis) under varying levels of confounding due to population stratification (PS), with the x-axis describing the magnitude of the effect of the population labels on the exposure and outcome trait. (b) Power (y-axis) as a function of the causal effect size (x-axis). Results are averaged over 1000 simulation replicates.

References

    1. Bycroft C., Freeman C., Petkova D., Band G., Elliott L. T., Sharp K., Motyer A., Vukcevic D., Delaneau O., O’Connell J., et al. , “The uk biobank resource with deep phenotyping and genomic data,” Nature, vol. 562, no. 7726, pp. 203–209, 2018. - PMC - PubMed
    1. Hemani G., Zheng J., Elsworth B., Wade K. H., Haberland V., Baird D., Laurin C., Burgess S., Bowden J., Langdon R., et al. , “The mr-base platform supports systematic causal inference across the human phenome,” elife, vol. 7, p. e34408, 2018. - PMC - PubMed
    1. Wade K. H., Carslake D., Sattar N., Davey Smith G., and Timpson N. J., “Bmi and mortality in uk biobank: revised estimates using mendelian randomization,” Obesity, vol. 26, no. 11, pp. 1796–1806, 2018. - PMC - PubMed
    1. Lyall D. M., Celis-Morales C., Ward J., Iliodromiti S., Anderson J. J., Gill J. M., Smith D. J., Ntuk U. E., Mackay D. F., Holmes M. V., et al. , “Association of body mass index with cardiometabolic disease in the uk biobank: a mendelian randomization study,” JAMA cardiology, vol. 2, no. 8, pp. 882–889, 2017. - PMC - PubMed
    1. Haycock P. C., Burgess S., Nounu A., Zheng J., Okoli G. N., Bowden J., Wade K. H., Timpson N. J., Evans D. M., Willeit P., et al. , “Association between telomere length and risk of cancer and non-neoplastic diseases: a mendelian randomization study,” JAMA oncology, vol. 3, no. 5, pp. 636–651, 2017. - PMC - PubMed

Publication types