Deep Learning and Machine Learning with Grid Search to Predict Later Occurrence of Breast Cancer Metastasis Using Clinical Data
- PMID: 36233640
- PMCID: PMC9570670
- DOI: 10.3390/jcm11195772
Deep Learning and Machine Learning with Grid Search to Predict Later Occurrence of Breast Cancer Metastasis Using Clinical Data
Abstract
Background: It is important to be able to predict, for each individual patient, the likelihood of later metastatic occurrence, because the prediction can guide treatment plans tailored to a specific patient to prevent metastasis and to help avoid under-treatment or over-treatment. Deep neural network (DNN) learning, commonly referred to as deep learning, has become popular due to its success in image detection and prediction, but questions such as whether deep learning outperforms other machine learning methods when using non-image clinical data remain unanswered. Grid search has been introduced to deep learning hyperparameter tuning for the purpose of improving its prediction performance, but the effect of grid search on other machine learning methods are under-studied. In this research, we take the empirical approach to study the performance of deep learning and other machine learning methods when using non-image clinical data to predict the occurrence of breast cancer metastasis (BCM) 5, 10, or 15 years after the initial treatment. We developed prediction models using the deep feedforward neural network (DFNN) methods, as well as models using nine other machine learning methods, including naïve Bayes (NB), logistic regression (LR), support vector machine (SVM), LASSO, decision tree (DT), k-nearest neighbor (KNN), random forest (RF), AdaBoost (ADB), and XGBoost (XGB). We used grid search to tune hyperparameters for all methods. We then compared our feedforward deep learning models to the models trained using the nine other machine learning methods.
Results: Based on the mean test AUC (Area under the ROC Curve) results, DFNN ranks 6th, 4th, and 3rd when predicting 5-year, 10-year, and 15-year BCM, respectively, out of 10 methods. The top performing methods in predicting 5-year BCM are XGB (1st), RF (2nd), and KNN (3rd). For predicting 10-year BCM, the top performers are XGB (1st), RF (2nd), and NB (3rd). Finally, for 15-year BCM, the top performers are SVM (1st), LR and LASSO (tied for 2nd), and DFNN (3rd). The ensemble methods RF and XGB outperform other methods when data are less balanced, while SVM, LR, LASSO, and DFNN outperform other methods when data are more balanced. Our statistical testing results show that at a significance level of 0.05, DFNN overall performs comparably to other machine learning methods when predicting 5-year, 10-year, and 15-year BCM.
Conclusions: Our results show that deep learning with grid search overall performs at least as well as other machine learning methods when using non-image clinical data. It is interesting to note that some of the other machine learning methods, such as XGB, RF, and SVM, are very strong competitors of DFNN when incorporating grid search. It is also worth noting that the computation time required to do grid search with DFNN is much more than that required to do grid search with the other nine machine learning methods.
Keywords: DNN; EHR; breast cancer; clinical; deep learning; machine learning; metastasis; metastatic breast cancer; non-image; prediction.
Conflict of interest statement
The authors declare no conflict of interest.
Figures







Similar articles
-
Artificial intelligence in clinical care amidst COVID-19 pandemic: A systematic review.Comput Struct Biotechnol J. 2021;19:2833-2850. doi: 10.1016/j.csbj.2021.05.010. Epub 2021 May 7. Comput Struct Biotechnol J. 2021. PMID: 34025952 Free PMC article. Review.
-
Machine learning and deep learning methods that use omics data for metastasis prediction.Comput Struct Biotechnol J. 2021 Sep 4;19:5008-5018. doi: 10.1016/j.csbj.2021.09.001. eCollection 2021. Comput Struct Biotechnol J. 2021. PMID: 34589181 Free PMC article. Review.
-
Machine learning models predicting undertriage in telephone triage.Ann Med. 2022 Dec;54(1):2990-2997. doi: 10.1080/07853890.2022.2136402. Ann Med. 2022. PMID: 36286496 Free PMC article.
-
Heterogeneous Ensemble Deep Learning Model for Enhanced Arabic Sentiment Analysis.Sensors (Basel). 2022 May 12;22(10):3707. doi: 10.3390/s22103707. Sensors (Basel). 2022. PMID: 35632116 Free PMC article.
-
Predicting efficacy of antiseizure medication treatment with machine learning algorithms in North Indian population.Epilepsy Res. 2024 Sep;205:107404. doi: 10.1016/j.eplepsyres.2024.107404. Epub 2024 Jul 1. Epilepsy Res. 2024. PMID: 38996687
Cited by
-
Multi-Modal Fusion of Routine Care Electronic Health Records (EHR): A Scoping Review.Information (Basel). 2025 Jan;16(1):10.3390/info16010054. doi: 10.3390/info16010054. Epub 2025 Jan 15. Information (Basel). 2025. PMID: 40843145 Free PMC article.
-
Enhancing groundwater quality assessment in coastal area: A hybrid modeling approach.Heliyon. 2024 Jun 19;10(13):e33082. doi: 10.1016/j.heliyon.2024.e33082. eCollection 2024 Jul 15. Heliyon. 2024. PMID: 39027495 Free PMC article.
-
Predicting no-shows at outpatient appointments in internal medicine using machine learning models.PeerJ Comput Sci. 2025 Apr 22;11:e2762. doi: 10.7717/peerj-cs.2762. eCollection 2025. PeerJ Comput Sci. 2025. PMID: 40567710 Free PMC article.
-
Development and validation of a risk prediction model for kinesiophobia in postoperative lung cancer patients: an interpretable machine learning algorithm study.Sci Rep. 2025 Jun 3;15(1):19412. doi: 10.1038/s41598-025-03575-7. Sci Rep. 2025. PMID: 40461518 Free PMC article.
-
Deep Learning: A Heuristic Three-Stage Mechanism for Grid Searches to Optimize the Future Risk Prediction of Breast Cancer Metastasis Using EHR-Based Clinical Data.Cancers (Basel). 2025 Mar 25;17(7):1092. doi: 10.3390/cancers17071092. Cancers (Basel). 2025. PMID: 40227603 Free PMC article.
References
-
- American Cancer Society Cancer Facts & Figures. 2021. [(accessed on 8 July 2021)]. Available online: https://www.cancer.org/research/cancer-facts-statistics/all-cancer-facts....
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials