Integration strategies of multi-omics data for machine learning analysis
- PMID: 34285775
- PMCID: PMC8258788
- DOI: 10.1016/j.csbj.2021.06.030
Integration strategies of multi-omics data for machine learning analysis
Abstract
Increased availability of high-throughput technologies has generated an ever-growing number of omics data that seek to portray many different but complementary biological layers including genomics, epigenomics, transcriptomics, proteomics, and metabolomics. New insight from these data have been obtained by machine learning algorithms that have produced diagnostic and classification biomarkers. Most biomarkers obtained to date however only include one omic measurement at a time and thus do not take full advantage of recent multi-omics experiments that now capture the entire complexity of biological systems. Multi-omics data integration strategies are needed to combine the complementary knowledge brought by each omics layer. We have summarized the most recent data integration methods/ frameworks into five different integration strategies: early, mixed, intermediate, late and hierarchical. In this mini-review, we focus on challenges and existing multi-omics integration strategies by paying special attention to machine learning applications.
Keywords: Deep learning; Integration strategy; Machine learning; Multi-omics; Multi-view; Network.
© 2021 The Author(s).
Conflict of interest statement
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Figures
References
Publication types
LinkOut - more resources
Full Text Sources
Other Literature Sources