Predicting the risk of colorectal cancer among diabetes patients using a random survival forest-guided approach
- PMID: 39403338
- PMCID: PMC11471444
- DOI: 10.3389/fonc.2024.1457446
Predicting the risk of colorectal cancer among diabetes patients using a random survival forest-guided approach
Abstract
Background: Colorectal cancer (CRC) is the third most frequently diagnosed cancer worldwide. Diabetes and CRC share many overlapping lifestyle risk factors such as obesity, heavy alcohol use, and diet. This study aims to develop a risk scoring system for CRC prediction among diabetes patients using routine medical records.
Methods: A retrospective cohort study was conducted using electronic health records of Hong Kong. Patients who received diabetes care in public general outpatient clinics between 2010 and 2019 and had no cancer history were identified, and followed up until December 2019. The outcome was diagnosis of CRC during follow-up. For model building, predictors were first selected using random survival forest, and weights were subsequently assigned to selected predictors using Cox regression.
Results: Of the 386,325 patients identified, 4,199 patients developed CRC during a median follow-up of 6.2 years. The overall incidence rate of CRC was 1.93 per 1000 person-years. In the final scoring system, age, waist-to-hip ratio, and serum creatinine were included as predictors. The C-index on test set was 0.651 (95%CI: 0.631-0.669). Elevated serum creatinine (≥127 µmol/L) could be a potential important predictor of increased CRC risk.
Conclusion: While obesity is a well-known risk factor for CRC, renal dysfunction could be potentially linked to an elevated risk of CRC among diabetes patients. Further studies are warranted to explore whether renal function could be a potential parameter to guide screening recommendation for diabetes patients.
Keywords: colorectal cancer; diabetes; random forest; risk prediction; survival analysis.
Copyright © 2024 Yau, Hung, Leung, Chong, Lee and Yeoh.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures
Similar articles
-
Point-Based Prediction Model for Bladder Cancer Risk in Diabetes: A Random Survival Forest-Guided Approach.J Clin Med. 2024 Dec 24;14(1):4. doi: 10.3390/jcm14010004. J Clin Med. 2024. PMID: 39797086 Free PMC article.
-
Survival Tree Analysis of Interactions Among Factors Associated With Colorectal Cancer Risk in Patients With Type 2 Diabetes: Retrospective Cohort Study.JMIR Public Health Surveill. 2025 Apr 29;11:e62756. doi: 10.2196/62756. JMIR Public Health Surveill. 2025. PMID: 40300170 Free PMC article.
-
Scoring System for Predicting the Risk of Liver Cancer among Diabetes Patients: A Random Survival Forest-Guided Approach.Cancers (Basel). 2024 Jun 24;16(13):2310. doi: 10.3390/cancers16132310. Cancers (Basel). 2024. PMID: 39001373 Free PMC article.
-
New onset of type 2 diabetes after colorectal cancer diagnosis: Results from three prospective US cohort studies, systematic review, and meta-analysis.EBioMedicine. 2022 Dec;86:104345. doi: 10.1016/j.ebiom.2022.104345. Epub 2022 Nov 11. EBioMedicine. 2022. PMID: 36371990 Free PMC article.
-
Is diabetes a causal agent for colorectal cancer? Pathophysiological and molecular mechanisms.World J Gastroenterol. 2011 Jan 28;17(4):444-8. doi: 10.3748/wjg.v17.i4.444. World J Gastroenterol. 2011. PMID: 21274373 Free PMC article. Review.
Cited by
-
Risk factors for cancer among patients with type 2 diabetes: a retrospective cohort study.BMC Cancer. 2025 Jul 1;25(1):1059. doi: 10.1186/s12885-025-14483-4. BMC Cancer. 2025. PMID: 40597094 Free PMC article.
References
LinkOut - more resources
Full Text Sources