Development of Machine Learning Systems to Predict Cancer-Related Symptoms With Validation Across a Health Care System
- PMID: 40997295
- PMCID: PMC12487662
- DOI: 10.1200/CCI-25-00073
Development of Machine Learning Systems to Predict Cancer-Related Symptoms With Validation Across a Health Care System
Abstract
Purpose: Cancer and its treatment cause symptoms. In this study, we aimed to develop machine learning (ML) systems that predict future symptom deterioration among people receiving treatment for cancer and then validate the systems in a simulated deployment across an entire health care system.
Methods: We trained and tested ML systems that predict a deterioration in nine patient-reported symptoms within 30 days after treatments for aerodigestive cancers, using internal electronic health record (EHR) data at Princess Margaret Cancer Centre (3,229 patients; 20,267 treatments). The primary performance metric was the area under the receiver operating characteristic curve (AUROC). The best-performing systems in the held-out internal test set were then externally validated across 82 cancer centers in Ontario (12,079 patients; 77,003 treatments) by adapting techniques from meta-analysis.
Results: The best ML systems predicted symptom deterioration with AUROCs ranging from 0.66 (95% CI, 0.63 to 0.69) for dyspnea to 0.73 (95% CI, 0.71 to 0.75) for drowsiness in the internal test cohort. Treatments flagged as high-risk were significantly associated with future symptom deterioration (odds ratios [ORs], 2.53-6.56; all P < .001) and emergency department visits for dyspnea (OR, 1.85; P = .008), depression (OR, 1.84; P = .04), and anxiety (OR, 2.66; P < .001). In the external validation cohort, the AUROCs for different symptoms meta-analyzed across centers ranged from 0.67 (95% CI, 0.66 to 0.68) to 0.73 (95% CI, 0.72 to 0.74). Performance across centers displayed significant heterogeneity for six of nine symptoms (I2, 46.4%-66.9%; P = .004 for dyspnea, P < .001 for the rest).
Conclusion: ML can predict future symptoms among people with cancer from routine EHR data, which could guide personalized interventions. Heterogeneous performance across centers must be considered when systems are deployed across a health care system.
Conflict of interest statement
The following represents disclosure information provided by authors of this manuscript. All relationships are considered compensated unless otherwise noted. Relationships are self-held unless noted. I = Immediate Family Member, Inst = My Institution. Relationships may not relate to the subject matter of this manuscript. For more information about ASCO's conflict of interest policy, please refer to
Open Payments is a public database containing information reported by companies about payments made to US-licensed physicians (
No other potential conflicts of interest were reported.
Figures
References
-
- Cherny NI, Sullivan R, Dafni U, et al. : A standardised, generic, validated approach to stratify the magnitude of clinical benefit that can be anticipated from anti-cancer therapies: The European Society for Medical Oncology Magnitude of Clinical Benefit Scale (ESMO-MCBS). Ann Oncol 26:1547-1573, 2015 - PubMed
-
- Aaronson NK, Ahmedzai S, Bergman B, et al. : The European Organization for Research and Treatment of Cancer QLQ-C30: A quality-of-life instrument for use in international clinical trials in oncology. J Natl Cancer Inst 85:365-376, 1993 - PubMed
-
- Cella DF, Tulsky DS, Gray G, et al. : The Functional Assessment of Cancer Therapy scale: Development and validation of the general measure. J Clin Oncol 11:570-579, 1993 - PubMed
-
- Bubis LD, Davis L, Mahar A, et al. : Symptom burden in the first year after cancer diagnosis: An analysis of patient-reported outcomes. J Clin Oncol 36:1103-1111, 2018 - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
