Development and validation of hybrid machine learning approach for predicting survival in patients with cervical cancer: a SEER-based population study
- PMID: 40606977
- PMCID: PMC12213391
- DOI: 10.3389/fonc.2025.1605378
Development and validation of hybrid machine learning approach for predicting survival in patients with cervical cancer: a SEER-based population study
Abstract
Background: Accurate survival prediction in cervical cancer is crucial for personalized therapy, particularly in high-risk groups where early intervention might enhance results. The study aims to create a hybrid survival model that integrates Cox Proportional Hazards (CoxPH) with Elastic Net regularization and Random Survival Forest (RSF) to improve prediction accuracy and interpretability.
Methods: Data from the SEER database (2013-2015) were pre-processed through normalization and encoding. RSF recorded non-linear interactions between covariates, while the CoxPH Elastic Net Regularization model provided linear interpretability and identified key variables. Model parameters were optimized using cross-validation, and final performance was assessed on an independent test set using metrics including C-index, Integrated Brier Score (IBS), AUC-ROC, and calibration plots.
Results: The hybrid model outperformed the individual models with an Integrated Brier Score (IBS) of 0.13 and a concordance index (C-index) of 0.82. With an AUC-ROC of 0.84, the model provided robust calibration and classification performance on the independent test set, effectively separating between individuals at high and low risk.
Conclusion: The hybrid model provides a promising tool for personalized risk stratification in cervical cancer based on survival probability. Further testing in varied clinical categories is recommended to confirm its efficiency in precision oncology.
Keywords: SEER database; cervical cancer; hybrid models; machine learning; survival models.
Copyright © 2025 Kolasseri and B.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures






Similar articles
-
Comparison of Random Survival Forest Based-Overall Survival With Deep Learning and Cox Proportional Hazard Models in HER-2-Positive HR-Negative Breast Cancer.Cancer Rep (Hoboken). 2025 Jul;8(7):e70262. doi: 10.1002/cnr2.70262. Cancer Rep (Hoboken). 2025. PMID: 40624807 Free PMC article.
-
OncoE25: an AI model for predicting postoperative prognosis in early-onset stage I-III colon and rectal cancer-a population-based study using SEER with dual-center cohort validation.J Transl Med. 2025 Jun 22;23(1):695. doi: 10.1186/s12967-025-06663-4. J Transl Med. 2025. PMID: 40545536 Free PMC article.
-
Development and validation of a Log odds of negative lymph nodes/T stage ratio-based prognostic model for gastric cancer.Front Oncol. 2025 Jun 3;15:1554270. doi: 10.3389/fonc.2025.1554270. eCollection 2025. Front Oncol. 2025. PMID: 40530015 Free PMC article.
-
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3. Cochrane Database Syst Rev. 2022. PMID: 35593186 Free PMC article.
-
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340. Health Technol Assess. 2006. PMID: 16959170
References
-
- Sun P, Yu C, Yin L, Chen Y, Sun Z, Zhang T, et al. Global, regional, and national burden of female cancers in women of child-bearing age, 1990–2021: analysis of data from the global burden of disease study 2021. EClinicalMedicine. (2024) 74:102713. doi: 10.1016/j.eclinm.2024.102713 - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources