Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 1;10(3):e25369.
doi: 10.1016/j.heliyon.2024.e25369. eCollection 2024 Feb 15.

A machine learning and deep learning-based integrated multi-omics technique for leukemia prediction

Affiliations

A machine learning and deep learning-based integrated multi-omics technique for leukemia prediction

Erum Yousef Abbasi et al. Heliyon. .

Abstract

In recent years, scientific data on cancer has expanded, providing potential for a better understanding of malignancies and improved tailored care. Advances in Artificial Intelligence (AI) processing power and algorithmic development position Machine Learning (ML) and Deep Learning (DL) as crucial players in predicting Leukemia, a blood cancer, using integrated multi-omics technology. However, realizing these goals demands novel approaches to harness this data deluge. This study introduces a novel Leukemia diagnosis approach, analyzing multi-omics data for accuracy using ML and DL algorithms. ML techniques, including Random Forest (RF), Naive Bayes (NB), Decision Tree (DT), Logistic Regression (LR), Gradient Boosting (GB), and DL methods such as Recurrent Neural Networks (RNN) and Feedforward Neural Networks (FNN) are compared. GB achieved 97 % accuracy in ML, while RNN outperformed by achieving 98 % accuracy in DL. This approach filters unclassified data effectively, demonstrating the significance of DL for leukemia prediction. The testing validation was based on 17 different features such as patient age, sex, mutation type, treatment methods, chromosomes, and others. Our study compares ML and DL techniques and chooses the best technique that gives optimum results. The study emphasizes the implications of high-throughput technology in healthcare, offering improved patient care.

Keywords: Deep learning; Genomics; Leukemia; Machine learning; Multi-omics.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
Overview of omics technologies.
Fig. 2
Fig. 2
Example of artificial neural networks.
Fig. 3
Fig. 3
A. Correlation coefficient 0 B. Correlation coefficient −1.
Fig. 4
Fig. 4
Correlation Coefficient with a positive relation.
Fig. 5
Fig. 5
Proposed methodology.
Fig. 6
Fig. 6
Flow chart of research study.
Fig. 7
Fig. 7
A. Correlation coefficient (part 1). B. Correlation coefficient (part 2).
Fig. 7
Fig. 7
A. Correlation coefficient (part 1). B. Correlation coefficient (part 2).
Fig. 8
Fig. 8
A. Consolidated features (part 1). B. Consolidated features (part 2). C. consolidated features (part 3).
Fig. 8
Fig. 8
A. Consolidated features (part 1). B. Consolidated features (part 2). C. consolidated features (part 3).
Fig. 8
Fig. 8
A. Consolidated features (part 1). B. Consolidated features (part 2). C. consolidated features (part 3).
Fig. 9
Fig. 9
Combined ROC curves for classification models.
Fig. 10
Fig. 10
A. Confusion matrix for GB; B. Confusion matrix for NB.
Fig. 11
Fig. 11
A. Confusion matrix for RF; B. Confusion matrix for DT.
Fig. 12
Fig. 12
Confusion matrix for LR.
Fig. 13
Fig. 13
A. Mse loss for FNN; B. Bce loss for FNN.
Fig. 14
Fig. 14
A. Bce loss for RNN; B. Mse loss for RNN.
Fig. 15
Fig. 15
A. Bce loss for FNN; B. Mse loss for FNN.
Fig. 16
Fig. 16
A. Mse loss for RNN; B. Bce loss for RNN.
Fig. 17
Fig. 17
A. RNN with MSE using Softmax Function; B. RNN with BCE using Softmax Function.
Fig. 18
Fig. 18
A. FNN with BCE using Softmax Function; B. FNN with MSE using Softmax Function.
Fig. 19
Fig. 19
A. FNN with MSE using Relu Function; B. FNN BCE Loss using Relu Function.
Fig. 20
Fig. 20
A. RNN with MSE using Relu Activation Function; B. RNN with BCE using Relu Activation Function.
Fig. 21
Fig. 21
Overall performance of DNNs.
Fig. 22
Fig. 22
Comparison of ML and DL techniques.

References

    1. Araaf M.A., Nugroho K., Setiadi D.R.I.M. Comprehensive analysis and classification of skin diseases based on image texture features using K-nearest neighbors algorithm. J. Comput. Theories Appl. Sep. 2023;1(1):31–40. doi: 10.33633/jcta.v1i1.9185. - DOI
    1. Picard M., Scott-Boyer M.-P., Bodein A., Périn O., Droit A. Integration strategies of multi-omics data for machine learning analysis. Comput. Struct. Biotechnol. J. 2021;19:3735–3746. doi: 10.1016/j.csbj.2021.06.030. - DOI - PMC - PubMed
    1. Santiago-Rodriguez T.M., Hollister E.B. Multi ‘omic data integration: a review of concepts, considerations, and approaches. Semin. Perinatol. Oct. 2021;45(6) doi: 10.1016/j.semperi.2021.151456. - DOI - PubMed
    1. Zheng J., et al. Integrative analysis of multi-omics identified the prognostic biomarkers in acute myelogenous leukemia. Front. Oncol. 2020;10(Dec) doi: 10.3389/fonc.2020.591937. - DOI - PMC - PubMed
    1. Chu X., Zhang B., Koeken V.A.C.M., Gupta M.K., Li Y. Multi-omics approaches in immunological research. Front. Immunol. 2021;12 doi: 10.3389/fimmu.2021.668045. Frontiers Media S.A. - DOI - PMC - PubMed

LinkOut - more resources