Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Dec 27;17(1):175.
doi: 10.1007/s12672-025-04275-4.

Explainable machine learning predicts overall survival in female bladder cancer patients after radical cystectomy

Affiliations

Explainable machine learning predicts overall survival in female bladder cancer patients after radical cystectomy

Ming Yan Zhong et al. Discov Oncol. .

Abstract

Background and purpose: Despite the higher incidence of bladder cancer in males, females face a disproportionately worse prognosis with more advanced disease. Accordingly, accurate survival prediction is crucial. This study sought to develop multiple machine learning models for predicting postoperative overall survival (OS) in female bladder cancer patients who have undergone radical cystectomy (RC).

Patients and methods: A retrospective analysis was conducted on female patients who underwent RC with postoperative pathological confirmation of bladder cancer in the SEER database from 2004 to 2022. These patients were randomly divided into a training set and an internal validation set at a 7:3 ratio. Additionally, 67 female bladder cancer patients from the Second Affiliated Hospital of Nanchang University were included as an external validation set. LASSO-Cox regression and Cox regression were used to identify independent prognostic factors for bladder cancer. Based on these factors, prediction models were constructed using five machine learning algorithms: Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Gradient Boosting Decision Tree (GBDT), Random Forest (RF), and XGBoost. The predictive performance of the models was evaluated using the area under the receiver operating characteristic curve (AUC), accuracy, precision, F1 score, concordance index (C-index), calibration curves, and decision curve analysis. The SHapley Additive exPlanations (SHAP) method was employed to interpret the most impactful features of the best-performing model.

Results: The study included a total of 4,603 patients. Lasso-Cox regression screening and Cox regression analysis revealed that T stage, N stage, age, tumor size, marital status, chemotherapy, and the number of examined lymph nodes (ELN) during surgery were significantly associated with the OS of female bladder cancer patients. Compared with other models, GBDT demonstrated superior discriminative ability in predicting 1-year, 3-year, and 5-year survival rates (1-year AUC(95%CI) = 0.771(0.742-0.801), 3-year AUC(95%CI) = 0.757(0.730-0.783), 5-year AUC(95%CI) = 0.745(0.721-0.770)), along with higher prediction accuracy, precision, C-index and F1 score. The calibration curve and decision curve analysis (DCA) confirmed the excellent predictive accuracy and clinical benefits of the GBDT model. These results were also validated in the external validation cohort. T stage, N stage, and chemotherapy were the most significant features of the optimal model, and SHAP analysis identified their important contributions within the model.

Conclusion: We have developed interpretable machine learning models to predict the OS of female bladder cancer patients following radical surgery. This model is intended to assist in clinical prognosis evaluation and provide a reference for individualized treatment decision-making.

Keywords: Female bladder cancer; Machine learning; Overall survival; Radical cystectomy; SEER.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: This study and its associated projects were approved by the Biomedical Research Ethics Committee of the Ethics Committee of the Second Affiliated Hospital of Nanchang University, and the requirement for informed consent was waived by the same Ethics Committee. All experiments were performed in accordance with relevant guidelines and regulations. Consent for publication: All authors have read and agreed to submit this manuscript for publication. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Research workflow diagram
Fig. 2
Fig. 2
(A) Coefficient profiles of clinical and pathological features in the LASSO regression. (B) Feature selection using LASSO regression with 10-fold cross-validation based on the minimum criterion. The penalty parameter (λ) was applied to shrink coefficients toward zero for automatic feature selection. (C) LASSO regression coefficients of the key selected features
Fig. 3
Fig. 3
ROC curves of SVM, KNN, GBDT, RF, and XGBoost models in the internal validation cohort (A, B, C) and external validation cohort (D, E, F)
Fig. 4
Fig. 4
Calibration curves of SVM, KNN, GBDT, RF, and XGBoost models in the internal (A, B, C) and external (D, E, F) validation cohorts, and decision curve analysis (DCA) of these models in the internal (G, H, I) and external (J, K, L) validation cohorts
Fig. 5
Fig. 5
Interpretability of the GBDT model evaluated using the SHAP method. (A) The SHAP bar plot displays feature importance based on mean absolute SHAP values. (B) The SHAP summary plot (beeswarm plot) illustrates the distribution of the impact of each feature on model predictions. Each point represents an individual patient, with colors indicating the feature value (red for high, blue for low). (C) The waterfall plot depicts the cumulative contribution path of features for a specific instance, starting from the base value to the model output

References

    1. Lobo N, Afferi L, Moschini M, et al. Epidemiology, screening, and prevention of bladder cancer. Eur Urol Oncol. 2022;5(6):628–39. 10.1016/j.euo.2022.10.003. - DOI - PubMed
    1. van Hoogstraten LMC, Vrieling A, van der Heijden AG, et al. Global trends in the epidemiology of bladder cancer: challenges for public health and clinical practice. Nat Rev Clin Oncol. 2023;20(5):287–304. 10.1038/s41571-023-00744-3. - DOI - PubMed
    1. Tian YQ, Yang JC, Hu JJ, et al. Trends and risk factors of global incidence, mortality, and disability of genitourinary cancers from 1990 to 2019: systematic analysis for the global burden of disease study 2019. Front Public Health. 2023;11:1119374. 10.3389/fpubh.2023.1119374. - DOI - PMC - PubMed
    1. Hyldgaard JM, Jensen JB. The inequality of females in bladder cancer. APMIS. 2021;129(12):694–9. 10.1111/apm.13183. - DOI - PubMed
    1. Goto T, Miyamoto H. The role of estrogen receptors in urothelial cancer. Front Endocrinol (Lausanne). 2021;12:643870. 10.3389/fendo.2021.643870. - DOI - PMC - PubMed

LinkOut - more resources