Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 5;35(9):1646-1660.e3.
doi: 10.1016/j.cmet.2023.07.012. Epub 2023 Aug 14.

Protein-metabolite association studies identify novel proteomic determinants of metabolite levels in human plasma

Affiliations

Protein-metabolite association studies identify novel proteomic determinants of metabolite levels in human plasma

Mark D Benson et al. Cell Metab. .

Abstract

Although many novel gene-metabolite and gene-protein associations have been identified using high-throughput biochemical profiling, systematic studies that leverage human genetics to illuminate causal relationships between circulating proteins and metabolites are lacking. Here, we performed protein-metabolite association studies in 3,626 plasma samples from three human cohorts. We detected 171,800 significant protein-metabolite pairwise correlations between 1,265 proteins and 365 metabolites, including established relationships in metabolic and signaling pathways such as the protein thyroxine-binding globulin and the metabolite thyroxine, as well as thousands of new findings. In Mendelian randomization (MR) analyses, we identified putative causal protein-to-metabolite associations. We experimentally validated top MR associations in proof-of-concept plasma metabolomics studies in three murine knockout strains of key protein regulators. These analyses identified previously unrecognized associations between bioactive proteins and metabolites in human plasma. We provide publicly available data to be leveraged for studies in human metabolism and disease.

Keywords: GWAS; functional genomics; genome-wide association study; genomics; metabolomics; multi-omics; pathway discovery; proteomics.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests M.D.B. reports investigator-initiated grants from Amgen and Boehringer-Ingelheim. P.N. reports investigator-initiated grants from Amgen, Apple, AstraZeneca, Boston Scientific, and Novartis and personal fees from Apple, AstraZeneca, Blackstone Life Sciences, Foresite Labs, Novartis, Roche/Genentech; is a co-founder of TenSixteen Bio; is a shareholder of geneXwell, TenSixteen Bio, and Vertex; is a scientific advisory board member of geneXwell and TenSixteen Bio; and reports spousal employment at Vertex, all unrelated to the present work.

Figures

Figure 1.
Figure 1.. The integration of human plasma proteomic, metabolomic, and genomic profiling data for pathway discovery.
Flow diagram detailing the experimental pipeline and main results from the integration of plasma proteomic, metabolomic, and genomic profiling datasets in the Jackson Heart Study (JHS), Multi-Ethnic Study of Atherosclerosis (MESA), and Health, Risk Factors, Exercise Training and Genetics Study (HERITAGE Family study). WGAS = whole genome association study, GWAS = genome wide association study.
Figure 2.
Figure 2.. Protein-metabolite correlations in human plasma.
Pearson correlation coefficients were calculated for every pairwise protein-metabolite combination within the JHS, MESA, and HERITAGE Family study using age- and sex-adjusted, log-normalized, and standardized protein and metabolite levels. (A) A subset of proteins and metabolites were positively (red) or negatively (blue) correlated with an FDR-adjusted q-value ≤ 0.05 in each study, and in a meta-analysis of the three studies. (B) Ninety-seven percent of the meta-analyzed age- and sex-adjusted protein-metabolite correlations remained significant with an FDR-adjusted q-value ≤0.05 when further adjusted for BMI, and 87% remained significant when additionally adjusted for eGFR. (C) The magnitude and directionality of the meta-analyzed age- and sex-adjusted protein-metabolite correlations were consistent across studies. (D) Visualization of the protein-metabolite correlations using a heat map demonstrated distinct patterns of associations between individual proteins and members of specific classes of metabolites. Individual metabolites were ordered according to RefMet class along the y-axis, and proteins were allowed to order using a hierarchical cluster analysis along the x-axis. The magnitude and directionality of the correlation coefficient for each protein-metabolite association is depicted by color, as indicated in the legend. (E) The statistical significance (−log(p-value)) of the correlation between each metabolite and four representative protein hormones is shown.
Figure 3.
Figure 3.. Protein correlations are significantly enriched for specific metabolite classes.
(A) A volcano plot demonstrates that the most significantly correlated metabolites with plasma levels of APOE protein were members of the lipid metabolite class. (B) To evaluate this enrichment for lipids more quantitatively, an enrichment score (ES) was computed and visualized as the maximum point (marked with ●) of a running sum statistic that increases proportionally with each lipid and decreases with non-lipids along the ranked list of metabolite correlations with plasma levels of APOE. (C) 241 proteins were significantly enriched for correlations with plasma lipids (FDR-adjusted q-value ≤ 0.05). In the top panel, the ES tracings are shown for each individual protein, and the ten proteins with the highest enrichment scores are listed. In the bottom panel, proteins are ordered in descending order of calculated enrichment scores (x-axis), and the number of proteins significantly enriched for correlations with lipids (FDR-adjusted q-value ≤ 0.05) is indicated with the vertical red line. Similar enrichment analyses are shown for Amino and Organic Acids (D), Carbohydrates (E), and Nucleic Acids (F). The complete enrichment analysis dataset is available in Supplemental Table 5.
Figure 4.
Figure 4.. Mendelian Randomization analyses identify putative causal protein-to-metabolite associations in humans.
Proteins with at least one pQTL in cis (located within 1Mb) of the protein coding gene that could be used as an instrumental variable (IV) in Mendelian Randomization (MR) analyses are depicted in blue (a). The proteins with available cis instruments were distributed evenly across the genome (b). pQTLs used in IVs were generally located near the transcriptional start site (TSS) of the protein coding gene (c). A volcano plot depicts the 224 significant MR associations between 52 proteins and 146 metabolites with an FDR-adjusted q-value ≤ 0.05 (d). The three proteins with the most significant MR metabolite associations are depicted by distinct shapes described in the figure legend.
Figure 5.
Figure 5.. Protein-to-metabolite causal associations predicted by Mendelian Randomization analyses in humans experimentally validate in murine knockout models.
Plasma metabolomics was conducted on C57BL/6 murine knockout (KO) strains for CD36 (n=6), APOE (n=6), and ACY1 (n=6) and compared to wild type (WT) controls (n=8). Scatterplots depict the number of predicted protein-to-metabolite MR associations in humans (with q≤0.1) that validated in each murine model (with p≤0.05, highlighted in color), as well as the concordance in directionality of these associations (a, c, e). The position on the x axis represents the p-value of the predicted MR association between each protein and metabolite level in the human studies. The position on the y axis represents the p-value of the difference in metabolite level between KO and WT mice. Metabolites on the right half of the scatterplots are predicted to be positively associated with each protein by MR, whereas metabolites on the left half are predicted to be inversely associated with each protein. Similarly, metabolites on the top half of the scatterplots were higher in KO vs WT animals, whereas metabolites on the bottom half were lower in KO vs. WT animals. The northwest and southeast quadrants of each scatter plot are shaded to depict consistent directionality between the MR predictions and KO experiments. The levels of each metabolite that were significantly different in KO vs. WT animals (p≤0.05) are shown as fold changes compared to WT animals (b, d, f). * indicates P<0.05, and ** indicates P<0.01 by students two tailed t-test.

Similar articles

Cited by

References

    1. Gieger C, Geistlinger L, Altmaier E, Hrabé de Angelis M, Kronenberg F, Meitinger T, Mewes HW, Wichmann HE, Weinberger KM, Adamski J, et al. (2008). Genetics meets metabolomics: a genome-wide association study of metabolite profiles in human serum. PLoS Genet 4, e1000282. 10.1371/journal.pgen.1000282. - DOI - PMC - PubMed
    1. Illig T, Gieger C, Zhai G, Römisch-Margl W, Wang-Sattler R, Prehn C, Altmaier E, Kastenmüller G, Kato BS, Mewes HW, et al. (2010). A genome-wide perspective of genetic variation in human metabolism. Nat Genet 42, 137–141. 10.1038/ng.507. - DOI - PMC - PubMed
    1. Suhre K, Shin SY, Petersen AK, Mohney RP, Meredith D, Wägele B, Altmaier E, Deloukas P, Erdmann J, Grundberg E, et al. (2011). Human metabolic individuality in biomedical and pharmaceutical research. Nature 477, 54–60. 10.1038/nature10354. - DOI - PMC - PubMed
    1. Kettunen J, Tukiainen T, Sarin AP, Ortega-Alonso A, Tikkanen E, Lyytikäinen LP, Kangas AJ, Soininen P, Würtz P, Silander K, et al. (2012). Genome-wide association study identifies multiple loci influencing human serum metabolite levels. Nat Genet 44, 269–276. 10.1038/ng.1073. - DOI - PMC - PubMed
    1. Rhee EP, Ho JE, Chen MH, Shen D, Cheng S, Larson MG, Ghorbani A, Shi X, Helenius IT, O’Donnell CJ, et al. (2013). A genome-wide association study of the human metabolome in a community-based cohort. Cell Metab 18, 130–143. 10.1016/j.cmet.2013.06.013. - DOI - PMC - PubMed

Publication types

Grants and funding