Web-based explainable machine-learning tool for predicting five-year recurrence of colorectal cancer after curative resection: multicentre retrospective cohort study
- PMID: 41186849
- DOI: 10.1007/s12672-025-03840-1
Web-based explainable machine-learning tool for predicting five-year recurrence of colorectal cancer after curative resection: multicentre retrospective cohort study
Abstract
Objective: The World Health Organization identifies colorectal cancer as the third-most diagnosed malignancy and second leading cause of cancer-related mortality worldwide. Up to 30% of patients relapse within 5 years postoperatively; however, conventional staging methods cannot reliably stratify individual risks, underscoring the need for precise, patient-centred decision-support tools.
Methods: We retrospectively analysed data on 1,789 colorectal cancer patients undergoing curative resection (2013-2023) from the Tri-Service General Hospital registry. Four tree-based machine learning algorithms were trained on demographic, tumour, immunohistochemical, and laboratory features. Patients were divided chronologically into training (January to September) and validation (October to December) sets. Feature importance was assessed using random forest impurity scores and Shapley additive explanations (SHAP). A web-based artificial intelligence-clinical decision support system (AI-CDSS) was developed to provide real-time, scenario-specific five-year recurrence-risk estimates.
Results: Of the 1,789 patients, 406 (22.7%) experienced recurrence. The top 10 predictors accounted for approximately 43% of the total model importance. SHAP analysis confirmed that tumour burden, biological markers, treatment intensity, and host factors were key drivers of recurrence risk. On the validation set, all models achieved area under the receiver operating characteristic curve values of 0.83-0.84. The random forest-based system demonstrated 87% accuracy, 85% positive predictive value, 87% negative predictive value, and an F1 score of 0.64, and was consequently selected as the AI-CDSS engine.
Conclusion: The proposed AI-CDSS delivers personalised five-year recurrence-risk estimates for patients with colorectal cancer within 1 s via an intuitive web interface, facilitating evidence-based, patient-centred treatment decisions grounded in local population data.
Keywords: Artificial intelligence; Clinical decision-support system; Colorectal cancer; Machine learning; Recurrence prediction.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Ethical approval and consent to participate: This study was approved by the Institutional Review Board of Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan (approval number A202305086) and conducted following the 1964 Declaration of Helsinki and its later amendments. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.
References
-
- Bray F, et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68:394–424. - PubMed
-
- Collaborators G. .C. The global, regional, and National burden of colorectal cancer and its attributable risk factors in 195 countries and territories, 1990–2017: a systematic analysis for the global burden of disease study 2017. Lancet Gastroenterol Hepatol. 2019;4:913–33. - DOI
-
- Ryu HS et al. Recurrence patterns and risk factors after curative resection for colorectal cancer: insights for postoperative surveillance strategies. Cancers (Basel) 15(2023).