Outlier Detection in Mendelian Randomization
- PMID: 40658369
- PMCID: PMC12258383
- DOI: 10.1002/sim.70143
Outlier Detection in Mendelian Randomization
Abstract
Mendelian randomization (MR) uses genetic variants as instrumental variables to infer causal effects of exposures on an outcome. One key assumption of MR is that the genetic variants used as instrumental variables are independent of the outcome conditional on the risk factor and unobserved confounders. Violations of this assumption, that is, the effect of the instrumental variables on the outcome through a path other than the risk factor included in the model (which can be caused by pleiotropy), are common phenomena in human genetics. Genetic variants, which deviate from this assumption, appear as outliers to the MR model fit and can be detected by the general heterogeneity statistics proposed in the literature, which are known to suffer from overdispersion, that is, too many genetic variants are declared as false outliers. We propose a method that corrects for overdispersion of the heterogeneity statistics in uni- and multivariable MR analysis by making use of the estimated inflation factor to correctly remove outlying instruments and therefore account for pleiotropic effects. Our method is applicable to summary-level data.
Keywords: Mendelian randomization; instrumental variables; invalid instruments; outlier detection; pleiotropy.
© 2025 The Author(s). Statistics in Medicine published by John Wiley & Sons Ltd.
Conflict of interest statement
The authors declare no conflicts of interest.
Figures



References
-
- Davey Smith G. and Ebrahim S., “Mendelian Randomization: Can Genetic Epidemiology Contribute to Understanding Environmental Determinants of Disease?,” International Journal of Epidemiology 32, no. 1 (2003): 1–22. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources