Automated pathway and reaction prediction facilitates in silico identification of unknown metabolites in human cohort studies
- PMID: 28479069
- DOI: 10.1016/j.jchromb.2017.04.002
Automated pathway and reaction prediction facilitates in silico identification of unknown metabolites in human cohort studies
Abstract
Identification of metabolites in non-targeted metabolomics continues to be a bottleneck in metabolomics studies in large human cohorts. Unidentified metabolites frequently emerge in the results of association studies linking metabolite levels to, for example, clinical phenotypes. For further analyses these unknown metabolites must be identified. Current approaches utilize chemical information, such as spectral details and fragmentation characteristics to determine components of unknown metabolites. Here, we propose a systems biology model exploiting the internal correlation structure of metabolite levels in combination with existing biochemical and genetic information to characterize properties of unknown molecules. Levels of 758 metabolites (439 known, 319 unknown) in human blood samples of 2279 subjects were measured using a non-targeted metabolomics platform (LC-MS and GC-MS). We reconstructed the structure of biochemical pathways that are imprinted in these metabolomics data by building an empirical network model based on 1040 significant partial correlations between metabolites. We further added associations of these metabolites to 134 genes from genome-wide association studies as well as reactions and functional relations to genes from the public database Recon 2 to the network model. From the local neighborhood in the network, we were able to predict the pathway annotation of 180 unknown metabolites. Furthermore, we classified 100 pairs of known and unknown and 45 pairs of unknown metabolites to 21 types of reactions based on their mass differences. As a proof of concept, we then looked further into the special case of predicted dehydrogenation reactions leading us to the selection of 39 candidate molecules for 5 unknown metabolites. Finally, we could verify 2 of those candidates by applying LC-MS analyses of commercially available candidate substances. The formerly unknown metabolites X-13891 and X-13069 were shown to be 2-dodecendioic acid and 9-tetradecenoic acid, respectively. Our data-driven approach based on measured metabolite levels and genetic associations as well as information from public resources can be used alone or together with methods utilizing spectral patterns as a complementary, automated and powerful method to characterize unknown metabolites.
Keywords: Biochemical pathway prediction; Metabolic network reconstruction; Metabolite identification; Non-targeted metabolomics; Reaction prediction.
Copyright © 2017. Published by Elsevier B.V.
Similar articles
-
The combination of four analytical methods to explore skeletal muscle metabolomics: Better coverage of metabolic pathways or a marketing argument?J Pharm Biomed Anal. 2018 Jan 30;148:273-279. doi: 10.1016/j.jpba.2017.10.013. Epub 2017 Oct 18. J Pharm Biomed Anal. 2018. PMID: 29059617
-
Automated pipeline for de novo metabolite identification using mass-spectrometry-based metabolomics.Anal Chem. 2013 Apr 2;85(7):3576-83. doi: 10.1021/ac303218u. Epub 2013 Mar 21. Anal Chem. 2013. PMID: 23368721
-
[A novel method for efficient screening and annotation of important pathway-associated metabolites based on the modified metabolome and probe molecules].Se Pu. 2022 Sep;40(9):788-796. doi: 10.3724/SP.J.1123.2022.03025. Se Pu. 2022. PMID: 36156625 Free PMC article. Chinese.
-
Metabolomic Strategies Involving Mass Spectrometry Combined with Liquid and Gas Chromatography.Adv Exp Med Biol. 2017;965:77-98. doi: 10.1007/978-3-319-47656-8_4. Adv Exp Med Biol. 2017. PMID: 28132177 Review.
-
Challenges and emergent solutions for LC-MS/MS based untargeted metabolomics in diseases.Mass Spectrom Rev. 2018 Nov;37(6):772-792. doi: 10.1002/mas.21562. Epub 2018 Feb 27. Mass Spectrom Rev. 2018. PMID: 29486047 Review.
Cited by
-
Inference of causal metabolite networks in the presence of invalid instrumental variables with GWAS summary data.Genet Epidemiol. 2023 Dec;47(8):585-599. doi: 10.1002/gepi.22535. Epub 2023 Aug 13. Genet Epidemiol. 2023. PMID: 37573486 Free PMC article.
-
Rare and common genetic determinants of metabolic individuality and their effects on human health.Nat Med. 2022 Nov;28(11):2321-2332. doi: 10.1038/s41591-022-02046-0. Epub 2022 Nov 10. Nat Med. 2022. PMID: 36357675 Free PMC article.
-
Physiological Perspectives on the Use of Triheptanoin as Anaplerotic Therapy for Long Chain Fatty Acid Oxidation Disorders.Front Genet. 2021 Jan 15;11:598760. doi: 10.3389/fgene.2020.598760. eCollection 2020. Front Genet. 2021. PMID: 33584796 Free PMC article.
-
Comparative metabolomics in the Pahenu2 classical PKU mouse identifies cerebral energy pathway disruption and oxidative stress.Mol Genet Metab. 2022 May;136(1):38-45. doi: 10.1016/j.ymgme.2022.03.004. Epub 2022 Mar 18. Mol Genet Metab. 2022. PMID: 35367142 Free PMC article.
-
Nanogrooves for 2D and 3D Microenvironments of SH-SY5Y Cultures in Brain-on-Chip Technology.Front Neurosci. 2020 Jun 24;14:666. doi: 10.3389/fnins.2020.00666. eCollection 2020. Front Neurosci. 2020. PMID: 32670014 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous