Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 26;25(1):940.
doi: 10.1186/s12885-025-14369-5.

Prediction of one-year recurrence among breast cancer patients undergone surgery using artificial intelligence-based algorithms: a retrospective study on prognostic factors

Affiliations

Prediction of one-year recurrence among breast cancer patients undergone surgery using artificial intelligence-based algorithms: a retrospective study on prognostic factors

Raoof Nopour. BMC Cancer. .

Abstract

Background and aim: Breast cancer is highly prevalent, with an increasing trend in women globally. Although the survival of breast cancer is relatively high, the recurrence rate is also high, demanding effective predictive solutions to breast cancer prognosis among post-operative patients. So far, Artificial intelligence algorithms integrated with various clinical data have demonstrated potential predictive capability regarding breast cancer recurrence.

Objective: This study aims to specifically conduct a predictive analysis of one-year recurrence of breast cancer by comparing and analyzing different machine learning and deep learning algorithms trained by structural prognostic data.

Materials and methods: This retrospective study was carried out using one database, including 1156 post-operative breast cancer data from 30 January 2020 to 30 December 2022, in three clinical centers in Tehran City. The inclusion criteria were patients who had undergone at least one surgery, had at least one year of medical records, and did not have other conditions. The patients who were diagnosed with malignant BC and had undergone adjuvant therapies without surgery were excluded from the study. Twenty-three prognostic factors were utilized to train algorithms to establish prediction models for the one-year recurrence of breast cancer. The data were analyzed using univariate and adjusted correlation-based methods and chosen machine learning and deep learning algorithms. The discrimination, calibration, and clinical utility were leveraged to assess the algorithms' performance efficiency. The SHapley Additive exPlanations plot was generated to identify the prominent prognostic factors affecting the one-year recurrence of breast cancer.

Results: Totally, 445 relapsed and 711 non-relapsed cases were utilized in this study. Our empirical study showed that the random forest with a positive predictive value of 0.96, negative predictive value of 0.92, sensitivity of 0.92, specificity of 0.96, accuracy of 0.94, F-score of 0.94, area under the receiver operator characteristics curve of 0.919 was the best-performing model for predicting the breast cancer recurrence. As the analysis of SHapley Additive exPlanations indicated, the tumor grade, HER-2, and the number of lymph nodes involved were more significant predictors.

Conclusion: The current study demonstrated the potential predictive power of the random forest for early predicting tumors among breast cancer patients who have undergone surgery and its utility in enhancing decision-making in clinical environments. It is crucial in promoting the prognosis, more effectively choosing therapies, augmenting post-operative breast cancer patients' survival, and controlling the limited healthcare resources.

Clinical trial number: Not applicable.

Keywords: Artificial intelligence; Breast cancer; Clinical utility; Prognosis; Tumor recurrence.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: This study was approved by the ethics committee of Tehran University of Medical Sciences (IR.TUMS.SPH.REC.1398.401). The study was performed in compliance with this institutional guideline, ethical guidelines for clinical research of the Iranian government, and the Declaration of Helsinki. The database leveraged was anonymous (did not include names or any identification detail), and confidentiality of information was assured. All methods were performed in accordance with the relevant guidelines and regulations by the Declaration of Helsinki. Informed consent was obtained from all subjects and/or their legal guardian(s). Consent for publication: Not applicable. Competing interests: The author declares no competing interests.

Figures

Fig. 1
Fig. 1
The flowchart of the sample selection process
Fig. 2
Fig. 2
The ROC curve plots for ML models
Fig. 3
Fig. 3
The ROC curve plots for DL models
Fig. 4
Fig. 4
Calibration plots for ML models
Fig. 5
Fig. 5
Calibration plots for DL models
Fig. 6
Fig. 6
The DCA of AI-based models. The NB curves for the chosen prognostic factors are indicated. The X and Y axes indicate the risk thresholds for predicting the one-year BC recurrence and NB, respectively
Fig. 7
Fig. 7
The SHAP summary plot for one-year BC recurrence. It put together feature importance and feature influence. Each point on the summary plot illustrates the Shapley value associated with one instance. The positions on the y and x axes represent the feature and Shapley values, respectively. The blue and red colors show the value of the individual features from low to high, respectively

Similar articles

Cited by

References

    1. Smolarz B, Nowak AZ, Romanowicz H. Breast cancer—epidemiology, classification, pathogenesis and treatment (Review of Literature). Cancers [Internet]. 2022; 14(10). - PMC - PubMed
    1. Łukasiewicz S, Czeczelewski M, Forma A, Baj J, Sitarz R, Stanisławek A, Breast, Cancer—Epidemiology. Risk factors, classification, prognostic markers, and current treatment Strategies—An. Updated Rev Cancers [Internet]. 2021; 13(17). - PMC - PubMed
    1. Huang J, Chan PS, Lok V, Chen X, Ding H, Jin Y, et al. Global incidence and mortality of breast cancer: a trend analysis. Aging. 2021;13(4):5748–803. - PMC - PubMed
    1. Lei S, Zheng R, Zhang S, Wang S, Chen R, Sun K, et al. Global patterns of breast cancer incidence and mortality: A population-based cancer registry data analysis from 2000 to 2020. Cancer Commun. 2021;41(11):1183–94. - PMC - PubMed
    1. Roheel A, Khan A, Anwar F, Akbar Z, Akhtar MF, Imran Khan M et al. Global epidemiology of breast cancer based on risk factors: a systematic review. Front Oncol. 2023;13. - PMC - PubMed