This is a preprint.
PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration
- PMID: 38260498
- PMCID: PMC10802464
- DOI: 10.1101/2024.01.09.574780
PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration
Update in
-
PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration.PLoS Comput Biol. 2024 Mar 25;20(3):e1011814. doi: 10.1371/journal.pcbi.1011814. eCollection 2024 Mar. PLoS Comput Biol. 2024. PMID: 38527092 Free PMC article.
Abstract
As terabytes of multi-omics data are being generated, there is an ever-increasing need for methods facilitating the integration and interpretation of such data. Current multi-omics integration methods typically output lists, clusters, or subnetworks of molecules related to an outcome. Even with expert domain knowledge, discerning the biological processes involved is a time-consuming activity. Here we propose PathIntegrate, a method for integrating multi-omics datasets based on pathways, designed to exploit knowledge of biological systems and thus provide interpretable models for such studies. PathIntegrate employs single-sample pathway analysis to transform multi-omics datasets from the molecular to the pathway-level, and applies a predictive single-view or multi-view model to integrate the data. Model outputs include multi-omics pathways ranked by their contribution to the outcome prediction, the contribution of each omics layer, and the importance of each molecule in a pathway. Using semi-synthetic data we demonstrate the benefit of grouping molecules into pathways to detect signals in low signal-to-noise scenarios, as well as the ability of PathIntegrate to precisely identify important pathways at low effect sizes. Finally, using COPD and COVID-19 data we showcase how PathIntegrate enables convenient integration and interpretation of complex high-dimensional multi-omics datasets. The PathIntegrate Python package is available at https://github.com/cwieder/PathIntegrate.
Figures






Similar articles
-
PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration.PLoS Comput Biol. 2024 Mar 25;20(3):e1011814. doi: 10.1371/journal.pcbi.1011814. eCollection 2024 Mar. PLoS Comput Biol. 2024. PMID: 38527092 Free PMC article.
-
DeepKEGG: a multi-omics data integration framework with biological insights for cancer recurrence prediction and biomarker discovery.Brief Bioinform. 2024 Mar 27;25(3):bbae185. doi: 10.1093/bib/bbae185. Brief Bioinform. 2024. PMID: 38678587 Free PMC article.
-
MUMA: a multi-omics meta-learning algorithm for data interpretation and classification.IEEE J Biomed Health Inform. 2024 Feb 12;PP. doi: 10.1109/JBHI.2024.3363081. Online ahead of print. IEEE J Biomed Health Inform. 2024. PMID: 38345953
-
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification.In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. PMID: 26269925 Free Books & Documents. Review.
-
Multimodal deep learning approaches for single-cell multi-omics data integration.Brief Bioinform. 2023 Sep 20;24(5):bbad313. doi: 10.1093/bib/bbad313. Brief Bioinform. 2023. PMID: 37651607 Free PMC article. Review.
References
-
- Canzler S. et al. Prospects and challenges of multi-omics data integration in toxicology. Arch Toxicol 94, 371–388 (2020). - PubMed
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials