Clinical significance of risk factor analysis in pancreatic cancer by using supervised model of machine learning
- PMID: 40491770
- PMCID: PMC12146380
- DOI: 10.3389/fmed.2025.1551926
Clinical significance of risk factor analysis in pancreatic cancer by using supervised model of machine learning
Abstract
Introduction: Pancreatic cancer (PC) poses a significant global health challenge due to its aggressive nature, late-stage diagnosis, and high mortality despite advancements in treatment. Early detection remains crucial for timely intervention. This study aimed to identify clinically relevant predictors of pancreatic cancer using a supervised machine learning approach and to develop a risk stratification tool with diagnostic capabilities.
Methods: A matched case-control study was conducted retrospectively at the Tenth People's Hospital of Tongji University (2017-2023), involving 353 cases and 370 matched controls. Demographic and hematological data were extracted from medical records. Variables were pre-selected using cluster dendrograms and subsequently refined using logistic regression with backward elimination and Support Vector Machine (SVM) models. A final risk scoring model was developed based on the best-performing model and internally validated.
Results: Key predictors retained in the final logistic regression model included Hemoglobin A1c (HbA1c) (OR 1.28; 95% CI: 1.08-1.52), Alkaline Phosphatase (ALP) (OR 1.02; 95% CI: 1.01-1.03), CA19-9 (OR 1.01; 95% CI: 1.01-1.01), Carcinoembryonic Antigen (CEA) (OR 1.41; 95% CI: 1.20-1.66), and Body Mass Index (BMI) (OR 0.88; 95% CI: 0.81-0.97). The final model demonstrated excellent diagnostic performance (AUC = 0.969, p < 0.001), with high accuracy, sensitivity, and specificity. A nomogram was constructed to facilitate individualized PC risk assessment.
Conclusion: HbA1c, ALP, CA19-9, CEA, and BMI were independently associated with pancreatic cancer. The machine learning-derived risk scoring model demonstrated high predictive accuracy and may serve as a valuable clinical tool for early detection and screening of pancreatic cancer.
Keywords: machine learning; pancreatic cancer; risk factors; risk scoring; supervised model.
Copyright © 2025 Sherchan, Jin, Sherchan, Mandal, Upadhaya Regmi, Ghising, Upadhaya, Gautam, Pathak and Li.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures






References
-
- Wild C, Stewart B, Wild C. World Cancer Report 2014. Geneva: World Health Organization; (2014).
-
- World Health Organization. World Health Organization Statistical Information System. WHo Mortality Database. Geneva: World Health Organization; (2012).
LinkOut - more resources
Full Text Sources