Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2020 Jun 30:10:1030.
doi: 10.3389/fonc.2020.01030. eCollection 2020.

Integrated Multi-Omics Analyses in Oncology: A Review of Machine Learning Methods and Tools

Affiliations
Review

Integrated Multi-Omics Analyses in Oncology: A Review of Machine Learning Methods and Tools

Giovanna Nicora et al. Front Oncol. .

Abstract

In recent years, high-throughput sequencing technologies provide unprecedented opportunity to depict cancer samples at multiple molecular levels. The integration and analysis of these multi-omics datasets is a crucial and critical step to gain actionable knowledge in a precision medicine framework. This paper explores recent data-driven methodologies that have been developed and applied to respond major challenges of stratified medicine in oncology, including patients' phenotyping, biomarker discovery, and drug repurposing. We systematically retrieved peer-reviewed journals published from 2014 to 2019, select and thoroughly describe the tools presenting the most promising innovations regarding the integration of heterogeneous data, the machine learning methodologies that successfully tackled the complexity of multi-omics data, and the frameworks to deliver actionable results for clinical practice. The review is organized according to the applied methods: Deep learning, Network-based methods, Clustering, Features Extraction, and Transformation, Factorization. We provide an overview of the tools available in each methodological group and underline the relationship among the different categories. Our analysis revealed how multi-omics datasets could be exploited to drive precision oncology, but also current limitations in the development of multi-omics data integration.

Keywords: cancer; machine learning; multi-omics; oncology; systematic review; tools.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(A) Linkage between different methodological categories. References to papers (see Table 1). That could be categorized in different groups are reported near the link. (B) Publications by year of publication and Field-Weighted Citation Impact. Different colors indicate exploited methods, shapes aims, and outputs. Papers with red borders have source code or provide a tool. Papers in the “Subgroup identification” group and/or with free tool result to be the most cited across years. The reference numbers are reported in Table 1.

References

    1. Shen R, Olshen AB, Ladanyi M. Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis. Bioinformatics. (2009) 25:2906–12. 10.1093/bioinformatics/btp543 - DOI - PMC - PubMed
    1. Lock EF, Hoadley KA, Marron JS, Nobel AB. Joint and individual variation explained (JIVE) for integrated analysis of multiple data types. Ann Appl Stat. (2013) 7:523–42. 10.1214/12-AOAS597 - DOI - PMC - PubMed
    1. List M, Hauschild A-C, Tan Q, Kruse TA, Mollenhauer J, Baumbach J, et al. . Classification of breast cancer subtypes by combining gene expression and DNA methylation data. J Integr Bioinform. (2014) 11:236. 10.2390/biecoll-jib-2014-236 - DOI - PubMed
    1. Ray P, Zheng L, Lucas J, Carin L. Bayesian joint analysis of heterogeneous genomics data. Bioinformatics. (2014) 30:1370–6. 10.1093/bioinformatics/btu064 - DOI - PubMed
    1. Gligorijević V, Malod-Dognin N, PrŽulj N. Patient-specific data fusion for cancer stratification and personalised treatment. Pacific Symp Biocomput. (2016) 21:321–332. 10.1142/9789814749411_0030 - DOI - PubMed

LinkOut - more resources