Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 May 5;109(5):767-782.
doi: 10.1016/j.ajhg.2022.04.001. Epub 2022 Apr 21.

Combining evidence from Mendelian randomization and colocalization: Review and comparison of approaches

Affiliations
Review

Combining evidence from Mendelian randomization and colocalization: Review and comparison of approaches

Verena Zuber et al. Am J Hum Genet. .

Abstract

Mendelian randomization and colocalization are two statistical approaches that can be applied to summarized data from genome-wide association studies (GWASs) to understand relationships between traits and diseases. However, despite similarities in scope, they are different in their objectives, implementation, and interpretation, in part because they were developed to serve different scientific communities. Mendelian randomization assesses whether genetic predictors of an exposure are associated with the outcome and interprets an association as evidence that the exposure has a causal effect on the outcome, whereas colocalization assesses whether two traits are affected by the same or distinct causal variants. When considering genetic variants in a single genetic region, both approaches can be performed. While a positive colocalization finding typically implies a non-zero Mendelian randomization estimate, the reverse is not generally true: there are several scenarios which would lead to a non-zero Mendelian randomization estimate but lack evidence for colocalization. These include the existence of distinct but correlated causal variants for the exposure and outcome, which would violate the Mendelian randomization assumptions, and a lack of strong associations with the outcome. As colocalization was developed in the GWAS tradition, typically evidence for colocalization is concluded only when there is strong evidence for associations with both traits. In contrast, a non-zero estimate from Mendelian randomization can be obtained despite only nominally significant genetic associations with the outcome at the locus. In this review, we discuss how the two approaches can provide complementary information on potential therapeutic targets.

Keywords: Causal inference; Genetic epidemiology; phenome-wide association study; post-GWAS investigations; shared heritability.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests D.G. is a part-time employee of Novo Nordisk. I.M. and C.W. are wholly or partially funded by a grant from GSK and MSD. The other authors have no relevant conflict of interest to declare.

Figures

Figure 1
Figure 1
Schematic diagram illustrating analogy between Mendelian randomization and randomized trial Adapted from Hingorani and Humphries.
Figure 2
Figure 2
Schematic diagrams illustrating colocalization in five scenarios (A) Two traits with distinct causal variants in linkage disequilibrium. (B) Two unrelated traits with a shared causal variant. (C) Two traits with a shared causal variant where the first trait influences the second trait. (D and E) One shared causal variant and one distinct causal variant for trait 2. Scenarios (B) and (C) are examples of colocalization. For scenarios (D) and (E), there is colocalization at the shared variant, but not at the distinct variant. Colocalization is unable to distinguish between the scenarios in which trait 1 and trait 2 are causally unrelated (scenarios B and D), and in which trait 1 has a causal effect on trait 2 (scenarios C and E). Illustrative regional association plots for each scenario represent the negative log10 p values for associations of variants with each trait (blue for trait 1, red for trait 2) plotted against chromosomal position.
Figure 3
Figure 3
Scatter plots of genetic associations with LDL cholesterol, coronary heart disease, and Alzheimer disease Genetic associations with LDL cholesterol (horizontal axis, standard deviation units) against genetic associations with (A) coronary heart disease and (B) Alzheimer disease (vertical axis, odds ratios) for 75 genetic variants associated with LDL cholesterol. Error bars represent 95% confidence intervals for the genetic associations; dashed line represents inverse-variance weighted estimate (dotted lines represent 95% confidence intervals for this estimate). In the right-hand plot, variants in the APOE gene region are marked with triangles.
Figure 4
Figure 4
Regional association plots for the PCSK9 gene region Genetic associations (negative log10 p values) plotted against chromosome position (megabases, Mb) for variants around the PCSK9 gene region with LDL cholesterol, coronary heart disease risk, and Alzheimer disease risk. Note the well-defined peak around the lead variant for both LDL cholesterol and coronary heart disease (marked in red), and the absence of a well-defined peak around any lead variant for Alzheimer disease. Colocalization suggests that LDL cholesterol and coronary heart disease have a shared causal variant, which is this lead variant, and no evidence that there is a causal variant for Alzheimer disease at this locus. Figures were made using the karyoploteR package: http://bioconductor.org/packages/release/bioc/html/karyoploteR.html.
Figure 5
Figure 5
Regional association plots for the APOE gene region Genetic associations (negative log10 p values) plotted against chromosome position (megabases, Mb) for variants around the APOE gene region with LDL cholesterol and Alzheimer disease risk. Note the well-defined peak around the lead variant for both traits (marked in green for LDL cholesterol, and blue for Alzheimer disease). However, in this case, colocalization suggests the peaks have distinct causal variants.

References

    1. Freedman M.L., Monteiro A.N.A., Gayther S.A., Coetzee G.A., Risch A., Plass C., Casey G., De Biasi M., Carlson C., Duggan D., et al. Principles for the post-GWAS functional characterization of cancer risk loci. Nat. Genet. 2011;43:513–518. doi: 10.1038/ng.840. - DOI - PMC - PubMed
    1. Gallagher M.D., Chen-Plotkin A.S. The post-GWAS era: from association to function. Am. J. Hum. Genet. 2018;102:717–730. doi: 10.1016/j.ajhg.2018.04.002. - DOI - PMC - PubMed
    1. Sheiner L.B., Rubin D.B. Intention-to-treat analysis and the goals of clinical trials. Clin. Pharmacol. Ther. 1995;57:6–15. doi: 10.1016/0009-9236(95)90260-0. - DOI - PubMed
    1. Hingorani A., Humphries S. Nature’s randomised trials. The Lancet. 2005;366:1906–1908. doi: 10.1016/s0140-6736(05)67767-7. - DOI - PubMed
    1. Thanassoulis G., O'Donnell C.J. Mendelian randomization: nature’s randomized trial in the post-genome era. J. Am. Med. Assoc. 2009;301:2386–2388. doi: 10.1001/jama.2009.812. - DOI - PMC - PubMed

Publication types