Developing machine learning models for personalized treatment strategies in early breast cancer patients undergoing neoadjuvant systemic therapy based on SEER database
- PMID: 39333608
- PMCID: PMC11436944
- DOI: 10.1038/s41598-024-72385-0
Developing machine learning models for personalized treatment strategies in early breast cancer patients undergoing neoadjuvant systemic therapy based on SEER database
Abstract
This study aimed to compare the long-term outcomes of breast-conserving surgery plus radiotherapy (BCS + RT) and mastectomy in early breast cancer (EBC) patients who received neoadjuvant systemic therapy (NST), and sought to construct and authenticate a machine learning algorithm that could assist healthcare professionals in formulating personalized treatment strategies for this patient population. We analyzed data from the Surveillance, Epidemiology, and End Results database on EBC patients undergoing BCS + RT or mastectomy post-NST (2010-2018). Employing propensity score matching (PSM) to minimize potential biases, we compared breast cancer-specific survival (BCSS) and overall survival (OS) between the two surgical groups. Additionally, we trained and validated six machine learning survival models and developed a cloud-based recommendation system for surgical treatment based on the optimal model. Among the 13,958 patients, 9028 (64.7%) underwent BCS + RT and 4930 (35.3%) underwent mastectomy. After PSM, there were 3715 patients in each group. Compared to mastectomy, BCS + RT significantly improved BCSS (p < 0.001) and OS (p < 0.001). Prognostic variables associated with BCSS were utilized to develop machine learning models. In both the training and validation cohorts, the random survival forest (RSF) model demonstrated superior predictive performance (0.847 and 0.795), not only outperforming other machine learning models, including Rpart (0.725 and 0.707), Xgboost (0.762 and 0.727), Glmboost (0.748 and 0.788), Survctree (0.764 and 0.766), and Survsvm (0.777 and 0.790), but also outperforming the classical COX model (0.749 and 0.782). Lastly, a web-based prediction tool was built to facilitate clinical application [ https://jhren.shinyapps.io/shinyapp1 ]. After adjusting other confounders, BCS + RT was associated with improved outcomes in patients with EBC after NST, compared to those who underwent mastectomy. Moreover, the RSF model, a reliable tool, can predict long-term outcomes for patients, providing valuable guidance for operative methods and postoperative follow-up.
Keywords: Breast-conserving surgery; Early breast cancer; Long-term outcomes; Machine learning; Mastectomy; Neoadjuvant systemic therapy.
© 2024. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures






Similar articles
-
Survival Comparisons between Breast Conservation Surgery and Mastectomy Followed by Postoperative Radiotherapy in Stage I-III Breast Cancer Patients: Analysis of the Surveillance, Epidemiology, and End Results (Seer) Program Database.Curr Oncol. 2022 Aug 15;29(8):5731-5747. doi: 10.3390/curroncol29080452. Curr Oncol. 2022. PMID: 36005190 Free PMC article.
-
Development and validation of novel machine learning-based prognostic models and propensity score matching for comparison of surgical approaches in mucinous breast cancer.Front Endocrinol (Lausanne). 2025 Jun 3;16:1557858. doi: 10.3389/fendo.2025.1557858. eCollection 2025. Front Endocrinol (Lausanne). 2025. PMID: 40529827 Free PMC article.
-
The different outcomes between breast-conserving surgery plus radiotherapy and mastectomy in metaplastic breast cancer: A population-based study.PLoS One. 2021 Sep 2;16(9):e0256893. doi: 10.1371/journal.pone.0256893. eCollection 2021. PLoS One. 2021. PMID: 34473783 Free PMC article.
-
Advancing personalized oncology: a systematic review on the integration of artificial intelligence in monitoring neoadjuvant treatment for breast cancer patients.BMC Cancer. 2024 Oct 21;24(1):1300. doi: 10.1186/s12885-024-13049-0. BMC Cancer. 2024. PMID: 39434042 Free PMC article.
-
Machine learning applications in breast cancer survival and therapeutic outcome prediction based on multi-omic analysis.Yi Chuan. 2024 Oct;46(10):820-832. doi: 10.16288/j.yczz.24-156. Yi Chuan. 2024. PMID: 39443311 Review.
Cited by
-
Comparison of Random Survival Forest Based-Overall Survival With Deep Learning and Cox Proportional Hazard Models in HER-2-Positive HR-Negative Breast Cancer.Cancer Rep (Hoboken). 2025 Jul;8(7):e70262. doi: 10.1002/cnr2.70262. Cancer Rep (Hoboken). 2025. PMID: 40624807 Free PMC article.
References
-
- Siegel, R. L., Miller, K. D., Wagle, N. S. & Jemal, A. Cancer statistics, 2023. CA Cancer J. Clin.73(1), 17–48 (2023). - PubMed
-
- Loibl, S., Poortmans, P., Morrow, M., Denkert, C. & Curigliano, G. Breast cancer. Lancet.397(10286), 1750–1769 (2021). - PubMed
-
- Mieog, J. S., van der Hage, J. A. & van de Velde, C. J. Neoadjuvant chemotherapy for operable breast cancer. Br. J. Surg.94(10), 1189–1200 (2007). - PubMed
-
- Wolmark, N., Wang, J., Mamounas, E., Bryant, J. & Fisher, B. Preoperative chemotherapy in patients with operable breast cancer: Nine-year results from National Surgical Adjuvant Breast and Bowel Project B-18. J. Natl. Cancer Inst. Monogr.30, 96–102 (2001). - PubMed
-
- Wrubel, E., Natwick, R. & Wright, G. P. Breast-conserving therapy is associated with improved survival compared with mastectomy for early-stage breast cancer: A propensity score matched comparison using the national cancer database. Ann. Surg. Oncol.28(2), 914–919 (2021). - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous