Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Sep:9:e2500073.
doi: 10.1200/CCI-25-00073. Epub 2025 Sep 25.

Development of Machine Learning Systems to Predict Cancer-Related Symptoms With Validation Across a Health Care System

Affiliations

Development of Machine Learning Systems to Predict Cancer-Related Symptoms With Validation Across a Health Care System

Baijiang Yuan et al. JCO Clin Cancer Inform. 2025 Sep.

Abstract

Purpose: Cancer and its treatment cause symptoms. In this study, we aimed to develop machine learning (ML) systems that predict future symptom deterioration among people receiving treatment for cancer and then validate the systems in a simulated deployment across an entire health care system.

Methods: We trained and tested ML systems that predict a deterioration in nine patient-reported symptoms within 30 days after treatments for aerodigestive cancers, using internal electronic health record (EHR) data at Princess Margaret Cancer Centre (3,229 patients; 20,267 treatments). The primary performance metric was the area under the receiver operating characteristic curve (AUROC). The best-performing systems in the held-out internal test set were then externally validated across 82 cancer centers in Ontario (12,079 patients; 77,003 treatments) by adapting techniques from meta-analysis.

Results: The best ML systems predicted symptom deterioration with AUROCs ranging from 0.66 (95% CI, 0.63 to 0.69) for dyspnea to 0.73 (95% CI, 0.71 to 0.75) for drowsiness in the internal test cohort. Treatments flagged as high-risk were significantly associated with future symptom deterioration (odds ratios [ORs], 2.53-6.56; all P < .001) and emergency department visits for dyspnea (OR, 1.85; P = .008), depression (OR, 1.84; P = .04), and anxiety (OR, 2.66; P < .001). In the external validation cohort, the AUROCs for different symptoms meta-analyzed across centers ranged from 0.67 (95% CI, 0.66 to 0.68) to 0.73 (95% CI, 0.72 to 0.74). Performance across centers displayed significant heterogeneity for six of nine symptoms (I2, 46.4%-66.9%; P = .004 for dyspnea, P < .001 for the rest).

Conclusion: ML can predict future symptoms among people with cancer from routine EHR data, which could guide personalized interventions. Heterogeneous performance across centers must be considered when systems are deployed across a health care system.

PubMed Disclaimer