A Practical Guide to Integrating Multimodal Machine Learning and Metabolic Modeling
- PMID: 35604554
- DOI: 10.1007/978-1-0716-1831-8_5
A Practical Guide to Integrating Multimodal Machine Learning and Metabolic Modeling
Abstract
Complex, distributed, and dynamic sets of clinical biomedical data are collectively referred to as multimodal clinical data. In order to accommodate the volume and heterogeneity of such diverse data types and aid in their interpretation when they are combined with a multi-scale predictive model, machine learning is a useful tool that can be wielded to deconstruct biological complexity and extract relevant outputs. Additionally, genome-scale metabolic models (GSMMs) are one of the main frameworks striving to bridge the gap between genotype and phenotype by incorporating prior biological knowledge into mechanistic models. Consequently, the utilization of GSMMs as a foundation for the integration of multi-omic data originating from different domains is a valuable pursuit towards refining predictions. In this chapter, we show how cancer multi-omic data can be analyzed via multimodal machine learning and metabolic modeling. Firstly, we focus on the merits of adopting an integrative systems biology led approach to biomedical data mining. Following this, we propose how constraint-based metabolic models can provide a stable yet adaptable foundation for the integration of multimodal data with machine learning. Finally, we provide a step-by-step tutorial for the combination of machine learning and GSMMs, which includes: (i) tissue-specific constraint-based modeling; (ii) survival analysis using time-to-event prediction for cancer; and (iii) classification and regression approaches for multimodal machine learning. The code associated with the tutorial can be found at https://github.com/Angione-Lab/Tutorials_Combining_ML_and_GSMM .
Keywords: Cancer survival prediction; Data integration; Flux balance analysis; Machine learning; Metabolic modeling; Multi-omics; Multimodal.
© 2022. This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply.
Similar articles
-
Machine learning for the advancement of genome-scale metabolic modeling.Biotechnol Adv. 2024 Sep;74:108400. doi: 10.1016/j.biotechadv.2024.108400. Epub 2024 Jun 27. Biotechnol Adv. 2024. PMID: 38944218 Review.
-
A mechanism-aware and multiomic machine-learning pipeline characterizes yeast cell growth.Proc Natl Acad Sci U S A. 2020 Aug 4;117(31):18869-18879. doi: 10.1073/pnas.2002959117. Epub 2020 Jul 16. Proc Natl Acad Sci U S A. 2020. PMID: 32675233 Free PMC article.
-
A Hybrid Flux Balance Analysis and Machine Learning Pipeline Elucidates Metabolic Adaptation in Cyanobacteria.iScience. 2020 Nov 18;23(12):101818. doi: 10.1016/j.isci.2020.101818. eCollection 2020 Dec 18. iScience. 2020. PMID: 33354660 Free PMC article.
-
Machine and deep learning meet genome-scale metabolic modeling.PLoS Comput Biol. 2019 Jul 11;15(7):e1007084. doi: 10.1371/journal.pcbi.1007084. eCollection 2019 Jul. PLoS Comput Biol. 2019. PMID: 31295267 Free PMC article. Review.
-
Optimization of Multi-Omic Genome-Scale Models: Methodologies, Hands-on Tutorial, and Perspectives.Methods Mol Biol. 2018;1716:389-408. doi: 10.1007/978-1-4939-7528-0_18. Methods Mol Biol. 2018. PMID: 29222764 Review.
Cited by
-
The Transformative Role of Artificial Intelligence in Dentistry: A Comprehensive Overview. Part 1: Fundamentals of AI, and its Contemporary Applications in Dentistry.Int Dent J. 2025 Apr;75(2):383-396. doi: 10.1016/j.identj.2025.02.005. Epub 2025 Mar 11. Int Dent J. 2025. PMID: 40074616 Free PMC article. Review.
References
-
- Shi Y, Kim S (2014) Towards information analysis for big data. In: 2014 7th conference on Control and automation (CA). IEEE, Piscataway, pp 3–5
-
- Gupta A (2015) Big data analysis using computational intelligence and Hadoop: a study. In: 2015 2nd international conference on computing for sustainable global development (INDIACom). IEEE, Piscataway, pp 1397–1401
-
- Ceri S, Kaitoua A, Masseroli M, Pinoli P, Venco F (2016) Data management for heterogeneous genomic datasets. IEEE/ACM Trans Comput Biol Bioinform 14(6):1251–1264 - PubMed
-
- Kench A, Janeja VP, Yesha Y, Rishe N, Grasso MA, Niskar A (2015) Clinico-genomic data analytics for precision diagnosis and disease management. In: 2015 international conference on healthcare informatics (ICHI). IEEE, Piscataway, pp 263–271
-
- Zieba A, Grannas K, Söderberg O, Gullberg M, Nilsson M, Landegren U (2012) Molecular tools for companion diagnostics. New Biotechnol 29(6):634–640
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous