Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Oct 10;150(10):455.
doi: 10.1007/s00432-024-05968-z.

RNA-Seq analysis for breast cancer detection: a study on paired tissue samples using hybrid optimization and deep learning techniques

Affiliations

RNA-Seq analysis for breast cancer detection: a study on paired tissue samples using hybrid optimization and deep learning techniques

Abrar Yaqoob et al. J Cancer Res Clin Oncol. .

Abstract

Problem: Breast cancer is a leading global health issue, contributing to high mortality rates among women. The challenge of early detection is exacerbated by the high dimensionality and complexity of gene expression data, which complicates the classification process.

Aim: This study aims to develop an advanced deep learning model that can accurately detect breast cancer using RNA-Seq gene expression data, while effectively addressing the challenges posed by the data's high dimensionality and complexity.

Methods: We introduce a novel hybrid gene selection approach that combines the Harris Hawk Optimization (HHO) and Whale Optimization (WO) algorithms with deep learning to improve feature selection and classification accuracy. The model's performance was compared to five conventional optimization algorithms integrated with deep learning: Genetic Algorithm (GA), Artificial Bee Colony (ABC), Cuckoo Search (CS), and Particle Swarm Optimization (PSO). RNA-Seq data was collected from 66 paired samples of normal and cancerous tissues from breast cancer patients at the Jawaharlal Nehru Cancer Hospital & Research Centre, Bhopal, India. Sequencing was performed by Biokart Genomics Lab, Bengaluru, India.

Results: The proposed model achieved a mean classification accuracy of 99.0%, consistently outperforming the GA, ABC, CS, and PSO methods. The dataset comprised 55 female breast cancer patients, including both early and advanced stages, along with age-matched healthy controls.

Conclusion: Our findings demonstrate that the hybrid gene selection approach using HHO and WO, combined with deep learning, is a powerful and accurate tool for breast cancer detection. This approach shows promise for early detection and could facilitate personalized treatment strategies, ultimately improving patient outcomes.

Keywords: Breast cancer; Deep learning; Harris Hawk algorithm; Whale optimization algorithm.

PubMed Disclaimer

Conflict of interest statement

The author confirms that they have no conflicts of interest to disclose.

Figures

Fig. 1
Fig. 1
Workflow of the steps followed in the study
Fig. 2
Fig. 2
Depicts the systematic process employed for conducting a literature review
Fig. 3
Fig. 3
Publication from the year 2015–2024 based on breast cancer with deep learning, machine learning
None
Algorithm 1
Fig. 4
Fig. 4
Deep learning configuration
Fig. 5
Fig. 5
The fitness graph of the proposed model
Fig. 6
Fig. 6
The prediction graph of proposed model
Fig. 7
Fig. 7
The energy value graph of the proposed algorithm
Fig. 8
Fig. 8
Confusion matrix for a HHWO, b HHO, c HHO and d PSO
Fig. 9
Fig. 9
Confusion matrix for a ABC, b GA, c CS and d SSA
Fig. 10
Fig. 10
ROC for deep learning model optimized by HHWO, WOA, and HHO
Fig. 11
Fig. 11
ROC curve of comparison algorithms
Fig. 12
Fig. 12
ROC Curve of different classifiers including SVM, KNN, DT, and NB
Fig. 13
Fig. 13
Comparison histogram based on Table 11

Similar articles

Cited by

References

    1. Abdi MJ, Hosseini SM, Rezghi M (2012) A novel weighted support vector machine based on particle swarm optimization for gene selection and tumor classification. Comput Math Methods Med 2012:1–7. 10.1155/2012/320698 - PMC - PubMed
    1. Abdollahzadeh B et al (2024) Puma optimizer (PO): a novel metaheuristic optimization algorithm and its application in machine learning. Cluster Comput 27(4):5235–5283. 10.1007/s10586-023-04221-5
    1. Agrawal P, Abutarboush HF, Ganesh T, Mohamed AW (2021) Metaheuristic algorithms on feature selection: a survey of one decade of research (2009–2019). IEEE Access 9:26766–26791. 10.1109/ACCESS.2021.3056407
    1. Alabool HM, Alarabiat D, Abualigah L, Heidari AA (2021) Harris hawks optimization: a comprehensive review of recent variants and applications. Neural Comput Appl 33(15):8939–8980. 10.1007/s00521-021-05720-5
    1. Algamal ZY, Lee MH (2015) Regularized logistic regression with adjusted adaptive elastic net for gene selection in high dimensional cancer classification. Comput Biol Med 67:136–145. 10.1016/j.compbiomed.2015.10.008 - PubMed

LinkOut - more resources