. 2020 Jun 26;11(1):3255.

doi: 10.1038/s41467-020-16969-0.

Genetic drug target validation using Mendelian randomisation

Amand F Schmidt^#^{1

2

3}, Chris Finan^#^{4

5}, Maria Gordillo-Marañón⁴, Folkert W Asselbergs^{4

6

7}, Daniel F Freitag⁸, Riyaz S Patel^{4

5}, Benoît Tyl⁹, Sandesh Chopade^{4

5}, Rupert Faraway^{4

5

10}, Magdalena Zwierzyna^{4

5}, Aroon D Hingorani^{4

5

7}

Affiliations

¹ Institute of Cardiovascular Science, Faculty of Population Health, University College London, London, WC1E 6BT, UK. amand.schmidt@ucl.ac.uk.
² UCL BHF Research Accelerator Centre, London, UK. amand.schmidt@ucl.ac.uk.
³ Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands. amand.schmidt@ucl.ac.uk.
⁴ Institute of Cardiovascular Science, Faculty of Population Health, University College London, London, WC1E 6BT, UK.
⁵ UCL BHF Research Accelerator Centre, London, UK.
⁶ Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands.
⁷ Health Data Research UK, 222 Euston Road, London, UK.
⁸ Bayer AG Pharmaceuticals, Open Innovation & Digital Technologies, Aprather Weg 18a, Wuppertal, 42096, Germany.
⁹ Center for Therapeutic Innovation, Cardiovascular and Metabolic Disease, Institut de Recherches Internationales Servier, 50 Rue Carnot, 92284, Suresnes Cedex, France.
¹⁰ The Francis Crick Institute, 1 Midland Rd, London, NW1 1AT, UK.

^# Contributed equally.

PMID: 32591531
PMCID: PMC7320010
DOI: 10.1038/s41467-020-16969-0

Genetic drug target validation using Mendelian randomisation

Amand F Schmidt et al. Nat Commun. 2020.

. 2020 Jun 26;11(1):3255.

doi: 10.1038/s41467-020-16969-0.

Authors

Affiliations

¹ Institute of Cardiovascular Science, Faculty of Population Health, University College London, London, WC1E 6BT, UK. amand.schmidt@ucl.ac.uk.
² UCL BHF Research Accelerator Centre, London, UK. amand.schmidt@ucl.ac.uk.
³ Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands. amand.schmidt@ucl.ac.uk.
⁴ Institute of Cardiovascular Science, Faculty of Population Health, University College London, London, WC1E 6BT, UK.
⁵ UCL BHF Research Accelerator Centre, London, UK.
⁶ Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands.
⁷ Health Data Research UK, 222 Euston Road, London, UK.
⁸ Bayer AG Pharmaceuticals, Open Innovation & Digital Technologies, Aprather Weg 18a, Wuppertal, 42096, Germany.
⁹ Center for Therapeutic Innovation, Cardiovascular and Metabolic Disease, Institut de Recherches Internationales Servier, 50 Rue Carnot, 92284, Suresnes Cedex, France.
¹⁰ The Francis Crick Institute, 1 Midland Rd, London, NW1 1AT, UK.

^# Contributed equally.

PMID: 32591531
PMCID: PMC7320010
DOI: 10.1038/s41467-020-16969-0

Abstract

Mendelian randomisation (MR) analysis is an important tool to elucidate the causal relevance of environmental and biological risk factors for disease. However, causal inference is undermined if genetic variants used to instrument a risk factor also influence alternative disease-pathways (horizontal pleiotropy). Here we report how the 'no horizontal pleiotropy assumption' is strengthened when proteins are the risk factors of interest. Proteins are typically the proximal effectors of biological processes encoded in the genome. Moreover, proteins are the targets of most medicines, so MR studies of drug targets are becoming a fundamental tool in drug development. To enable such studies, we introduce a mathematical framework that contrasts MR analysis of proteins with that of risk factors located more distally in the causal chain from gene to disease. We illustrate key model decisions and introduce an analytical framework for maximising power and evaluating the robustness of analyses.

PubMed Disclaimer

Conflict of interest statement

DFF is a full-time employee of Bayer AG, Germany. BT is a full-time employee of Servier. RSP has received honoraria from Sanofi, Bayer and Amgen. MZ is a full-time employee of GSK. AFS and FWA have received Servier funding for unrelated work. MZ conducted this research as an employee of BenevolentAI. Since completing the work MZ is now a full-time employee of GlaxoSmithKline. None of the remaining authors have a competing interest to declare.

Figures

**Fig. 1. Directed acyclic graphs of potential Mendelian randomisation pathways.**
Nodes are presented in bold face, with G representing a genetic variant, P a protein drug target, X a biomarker, D the outcome, and U (potentially unmeasured) common causes of both P, X, D. Labelled paths represent the (causal) effects between nodes.

**Fig. 2. Instrument selection related variation in the point estimates of drug target Mendelian randomisation studies on the lipid’s association with CHD.**
Each estimate is based on randomly (500 iterations) selecting 4 SNPs out of 17 *HMGCR*, 30 *PCSK9*, 21 *NPC1L1*, 36 *CETP* candidate variants. Lipids data were used from the GLGC and linked to coronary heart disease data from CardiogramPlusC4D. estimates were grouped by the inclusion of instruments with worsted predicted functional or regulatory consequence; categories occurring less than five times were removed. Any pairwise LD was accounted for using the 1000 genomes ‘EUR’ reference panel and a generalised least squares method. The boxplots depict quartiles 1, 2 (median), and 3 as a box, with the whiskers presented as vertical bars and values ±1.5 times the interquartile range as dots.

**Fig. 3. Mendelian randomisation estimates of the lipids weighted associations with CHD under increasingly liberal LD-clumping thresholds.**
Lipids data were used from the GLGC, and linked to coronary heart disease data from CardiogramPlusC4D. Pairwise LD remaining after LD-clumping was accounted for using the 1000 genomes ‘EUR’ reference panel and a generalised least squares method. Estimates for *PCSK9, HMGCR*, and *NPC1L1* are given per SD in LDL-C, *CETP* estimates per HDL-C reflecting the likely effectiveness pathway to CHD. The number of included variants is depicted above the x-axis. Estimates are given as OR with 95%CI (vertical error bars).

**Fig. 4. Mendelian randomisation estimates of the lipids weighted associations with CHD stratified by functionally of the included variants.**
Lipids data were used from the GLGC, and linked to coronary heart disease data from CardiogramPlusC4D. Pairwise LD remaining (after clumping on R-squared of 0.60) was accounted for using the 1000 genomes ‘EUR’ reference panel and a (GLS) generalised least squares method. Estimates for *PCSK9, HMGCR*, and *NPC1L1* are given per SD in LDL-C, *CETP* estimates per HDL-C reflecting the likely effectiveness pathway to CHD. Estimates are given as OR with 95%CI (vertical error bars). Numerical details, including the number of variants used, are provided in Supplementary Tables 6–9.

**Fig. 5. Mendelian randomisation estimates of protein level effects on CHD, with a grid of LD threshold.**
Pairwise LD was accounted for using the 1000 genomes ‘EUR’ reference panel and a (GLS) generalised least squares method with or without Egger correction for possible horizontal pleiotropy. The number of included variants in the 1 mega base flanking region is depicted above the x-axis of the top panels. Estimates are given as OR with 95%CI (vertical error bars). The top panel depicts the variant to CHD or protein level effect for clumping threshold 0.5 for CETP (based on an Egger correct GLS model), and at 0.4 for PCSK9 using an IVW GLS model; 38 and 9 variants, respectively. Notice that the PCSK9 estimates were only available on the natural logarithmic scale.

**Fig. 6. A pQTL based drug target MR phenome wide scan.**
Results are presented as odds ratios with 95% confidence intervals (horizontal lines), positioned in the protein increasing direction. The total number of events and sample size are provided in the forest plot.

**Fig. 7. A proposed drug target MR analysis framework.**
The influence of LD and genetic region can be explored (and optimized) through simple grid-searching. Robustness of model choices in LD reference panel, the selection of functional or regulatory variants, and outlying or influential (high leverage) variants can be explored thorugh sensitivity analyses showcased here and in the supplementary analyses.

See this image and copyright information in PMC

References

1. Hingorani A, Humphries S. Nature’s randomised trials. Lancet. 2005;366:1906–1908. - PubMed
1. Swerdlow DI, et al. The interleukin-6 receptor as a target for prevention of coronary heart disease: a mendelian randomisation analysis. Lancet. 2012;379:1214–1224. - PMC - PubMed
1. Sarwar N, Butterworth AS. Interleukin-6 receptor pathways in coronary heart disease: a collaborative meta-analysis of 82 studies. Lancet. 2012;379:1205–1213. - PMC - PubMed
1. Sun, B. B. et al. Genomic atlas of the human plasma proteome. Nature (2018) 10.1038/s41586-018-0175-2. - PMC - PubMed
1. Xu X, et al. Molecular insights into genome-wide association studies of chronic kidney disease-defining traits. Nat. Commun. 2018;9:4800. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Genetic drug target validation using Mendelian randomisation

Affiliations

Genetic drug target validation using Mendelian randomisation

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Medical