Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Oct;52(10):1122-1131.
doi: 10.1038/s41588-020-0682-6. Epub 2020 Sep 7.

Phenome-wide Mendelian randomization mapping the influence of the plasma proteome on complex diseases

Affiliations

Phenome-wide Mendelian randomization mapping the influence of the plasma proteome on complex diseases

Jie Zheng et al. Nat Genet. 2020 Oct.

Abstract

The human proteome is a major source of therapeutic targets. Recent genetic association analyses of the plasma proteome enable systematic evaluation of the causal consequences of variation in plasma protein levels. Here we estimated the effects of 1,002 proteins on 225 phenotypes using two-sample Mendelian randomization (MR) and colocalization. Of 413 associations supported by evidence from MR, 130 (31.5%) were not supported by results of colocalization analyses, suggesting that genetic confounding due to linkage disequilibrium is widespread in naïve phenome-wide association studies of proteins. Combining MR and colocalization evidence in cis-only analyses, we identified 111 putatively causal effects between 65 proteins and 52 disease-related phenotypes ( https://www.epigraphdb.org/pqtl/ ). Evaluation of data from historic drug development programs showed that target-indication pairs with MR and colocalization support were more likely to be approved, evidencing the value of this approach in identifying and prioritizing potential therapeutic targets.

PubMed Disclaimer

Conflict of interest statement

Competing Interests Statement

A. G., L. M., M. R. H., D. W., M. R. N., R. S., and R. A. S.are employees and shareholders in GlaxoSmithKline. H. R., J. Z. L., and K. E. are employees and shareholders in Biogen. J. Z and V. H. is employed on a grant funded by GlaxoSmithKline. D. B. is employed on a grant funded by Biogen. T. R. G., G. H., and G. D. S. receive funding from GlaxoSmithKline and Biogen for the work described here. A. S. B. has received grants from Merck, Novartis, Biogen, Pfizer and AstraZeneca. M. V. H. has collaborated with Boehringer Ingelheim in research, and in accordance with the policy of the Clinical Trial Service Unit and Epidemiological Studies Unit (University of Oxford), did not accept any personal payment.

This work was supported by Health Data Research UK, which is funded by the UK Medical Research Council, Engineering and Physical Sciences Research Council, Economic and Social Research Council, Department of Health and Social Care (England), Chief Scientist Office of the Scottish Government Health and Social Care Directorates, Health and Social Care Research and Development Division (Welsh Government), Public Health Agency (Northern Ireland), British Heart Foundation and Wellcome.

Figures

Figure 1
Figure 1. Study design of this phenome-wide MR study of the plasma proteome.
The study included instrument selection and validation, outcome selection, four types of MR analyses, colocalization, sensitivity analyses, and drug target validation.
Figure 2
Figure 2. A demonstration of pairwise conditional and colocalization (PWCoCo) analysis.
Assume there are two conditional independent association pQTL signals (SNP 1 and SNP 2) and two conditional independent outcome signals (SNP 1 and SNP3) in the tested region. A naïve colocalization analysis using marginal association statistics will return weak evidence of colocalization (showed in regional plots A and D). By conducting the analyses conditioning on SNP 2 (plot B) and 1 (plot C) for the pQTLs and conditioning on SNP 1 (plot E) and 3 (plot F) for the outcome phenotype, each of the ninepairwise combinations of pQTL and outcome association statistics (represented as lines with different colors in the middle of this figure) will be tested using colocalization. In this case, the combination of plot B and plot E shows evidence of colocalization but the remaining eightdo not.
Figure 3
Figure 3. Miami plot for the cis-only analysis, with circles representing the MR results for proteins on human phenotypes.
The labels refer to top MR findings with colocalization evidence, with each protein represented by one label. The color refers to top MR findings with P < 3.09 x 10-7, where red refers to immune-mediated phenotypes, blue refers to cardiovascularphenotypes, green refers to lung-related phenotypes, purple refers to bone phenotypes, orange refers to cancers, yellow refers to glycemic phenotypes, brown refers to psychiatric phenotypes, pink refers to other phenotypesand grey refers to phenotypes that showed less evidence of colocalization. The x-axis is the chromosome and position of each MR finding in the cis region. The y-axis is the -log10 P value of the MR findings, MR findings with positive effects (increased level of proteins associated with increasing the phenotype level) are represented by filled circles on the top of the Miami plot, while MR findings with negative effects (decreased level of proteins associated with increasing the phenotype level) are on the bottom of the Miami plot.
Figure 4
Figure 4. Regional association plots of IL23R plasma protein level and Crohn’s disease in theIL23R region.
a, b, Regional plots of IL23R protein level and Crohn’s disease without conditional analysis. Plot in b lists the sets of conditionally independent signals for Crohn’s disease in this region: rs7517847, rs7528924, rs183020189, rs7528804 (a proxy for the second IL23R hit rs3762318, r =0.42 in the 1000 Genome Europeans) and rs11209026 (a proxy for the top IL23R hit rs11581607, r =1 in the 1000 Genome Europeans), conditional P value < 1x10-7. c, Regional plot of IL23R with the joint SNP effects conditioned on the second hit (rs3762318) for IL23R. d, Regional plot of Crohn’s disease with the joint SNP effects adjusted for other independent signals except the top IL23R signal rs11581607. e, Regional plot of IL23R with the joint SNP effects conditioned on the top hit (rs11581607) for IL23R. f, Regional plot of Crohn’s disease with the joint SNP effects adjusted for other independent signals except the second IL23R signal rs3762318. The heatmap ofthe colocalization evidence for IL23R association on Crohn’s disease (CD) in the IL23R region is presented in Supplementary Figure 4.
Figure 5
Figure 5. Enrichment of phenome-wide MR of the plasma proteome with the druggable genome.
In this figure, we only show proteins with convincing MR and colocalization evidence with at least one of the 70 phenotypes. The x-axis shows the categories of 70 human phenotypes, where the phenotypes have been grouped into 8 categories: 8 autoimmune diseases (red), 3 bone phenotypes (purple), 8 cancers (orange), 12 cardiovascular phenotypes (blue), 4 glycemic phenotypes (yellow), 2 lung phenotypes (green), 4 psychiatric phenotypes (brown), and 29 other phenotypes (pink). The y-axis presents the tiers of the druggable genome (as defined by Finan et al.) of 120 proteins under analysis, where the proteins have been classified into 4 groups based on their druggability: tier 1 contains 23 proteins that are efficacy targets of approved small molecules and biotherapeutic drugs, tier 2 contains 11 proteins closely related to approved drug targets or with associated drug-like compounds, tier 3 contains58 secreted or extracellular proteins or proteins distantly related to approved drug targets, and 28 proteins have unknown druggable status (Unclassified). The cells with colors are protein-phenotype associations with strong MR and colocalization evidence. Cells in green are associations overlapping with the tier 1 druggable genome, while cells in yellow, red or purple were associations with tier 2, tier 3 or unclassified. More detailed information is shown in Supplementary Table 24.

References

    1. Plenge RM, Scolnick EM, Altshuler D. Validating therapeutic targets through human genetics. Nat Rev Drug Discov. 2013;12:581–594. - PubMed
    1. Hay M, Thomas DW, Craighead JL, Economides C, Rosenthal J. Clinical development success rates for investigational drugs. Nat Biotechnol. 2014;32:40–51. - PubMed
    1. Arrowsmith J, Miller P. Phase II and Phase III attrition rates 2011-2012. Nat Rev Drug Discov. 2013;12:569. - PubMed
    1. Harrison RK. Phase II and phase III failures: 2013-2015. Nat Rev Drug Discov. 2016;15:817. - PubMed
    1. Cummings JL, Morstorf T, Zhong K. Alzheimer’s disease drug-development pipeline: few candidates, frequent failures. Alzheimers Res Ther. 2014;6:37. - PMC - PubMed

Publication types