Leveraging survival analysis and machine learning for accurate prediction of breast cancer recurrence and metastasis
- PMID: 39880868
- PMCID: PMC11779859
- DOI: 10.1038/s41598-025-87622-3
Leveraging survival analysis and machine learning for accurate prediction of breast cancer recurrence and metastasis
Abstract
Breast cancer, with its high incidence and mortality globally, necessitates early prediction of local and distant recurrence to improve treatment outcomes. This study develops and validates predictive models for breast cancer recurrence and metastasis using Recurrence-Free Survival Analysis and machine learning techniques. We merged datasets from the Molecular Taxonomy of Breast Cancer International Consortium, Memorial Sloan Kettering Cancer Center, Duke University, and the SEER program, creating a comprehensive dataset of 272, 252 rows and 23 columns. Our methodology utilized three predictive strategies: assessing recurrence risk, differentiating local from distant recurrences, and identifying potential metastatic sites. Key prognostic factors were identified through survival analysis. LightGBM, XGBoost, and Random Forest models were employed and validated against data from the Baheya Foundation. The models demonstrated strong performance; the survival analysis achieved a C-index of 0.837. The LightGBM model reached an AUC of 92% in predicting recurrences, while XGBoost and Random Forest models distinguished recurrence types with up to 86% accuracy, and they effectively differentiated between bone metastasis and all other locations combined (brain, liver, and lungs). This study highlights the significant potential of machine learning in advancing breast cancer management and sets a new benchmark for predictive analytics. Future research will integrate genetic data to further enhance these models.
Keywords: Breast cancer; Machine learning; Metastasis; Recurrence prediction; Survival analysis.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Competing interests: The authors declare no competing interests.
Figures
References
-
- Global Cancer Observatory, International Agency for Research on Cancer. Global cancer observatory. https://gco.iarc.fr/en.
-
- World Health Organization. Breast cancer. World Health Organization. https://www.who.int/news-room/fact-sheets/detail/breast-cancer.
-
- Abdelaziz, A. H. et al. Breast cancer awareness among egyptian women and the impact of caring for patients with breast cancer on family caregivers’ knowledge and behaviour. Res. Oncol.17, 1–8 (2021).
-
- Baheya Foundation. Baheya Foundation. https://baheya.org/en.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
