Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
- PMID: 24643167
- PMCID: PMC3963101
- DOI: 10.1136/bmjopen-2013-004007
Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
Abstract
Objectives: Using the prediction of cancer outcome as a model, we have tested the hypothesis that through analysing routinely collected digital data contained in an electronic administrative record (EAR), using machine-learning techniques, we could enhance conventional methods in predicting clinical outcomes.
Setting: A regional cancer centre in Australia.
Participants: Disease-specific data from a purpose-built cancer registry (Evaluation of Cancer Outcomes (ECO)) from 869 patients were used to predict survival at 6, 12 and 24 months. The model was validated with data from a further 94 patients, and results compared to the assessment of five specialist oncologists. Machine-learning prediction using ECO data was compared with that using EAR and a model combining ECO and EAR data.
Primary and secondary outcome measures: Survival prediction accuracy in terms of the area under the receiver operating characteristic curve (AUC).
Results: The ECO model yielded AUCs of 0.87 (95% CI 0.848 to 0.890) at 6 months, 0.796 (95% CI 0.774 to 0.823) at 12 months and 0.764 (95% CI 0.737 to 0.789) at 24 months. Each was slightly better than the performance of the clinician panel. The model performed consistently across a range of cancers, including rare cancers. Combining ECO and EAR data yielded better prediction than the ECO-based model (AUCs ranging from 0.757 to 0.997 for 6 months, AUCs from 0.689 to 0.988 for 12 months and AUCs from 0.713 to 0.973 for 24 months). The best prediction was for genitourinary, head and neck, lung, skin, and upper gastrointestinal tumours.
Conclusions: Machine learning applied to information from a disease-specific (cancer) database and the EAR can be used to predict clinical outcomes. Importantly, the approach described made use of digital data that is already routinely collected but underexploited by clinical health systems.
Keywords: Cancer; Electronic Medical Record; Machine Learning; Prediction; Survival.
Similar articles
-
Prediction of In-hospital Mortality in Emergency Department Patients With Sepsis: A Local Big Data-Driven, Machine Learning Approach.Acad Emerg Med. 2016 Mar;23(3):269-78. doi: 10.1111/acem.12876. Epub 2016 Feb 13. Acad Emerg Med. 2016. PMID: 26679719 Free PMC article.
-
Performance of a Machine Learning Algorithm Using Electronic Health Record Data to Identify and Estimate Survival in a Longitudinal Cohort of Patients With Lung Cancer.JAMA Netw Open. 2021 Jul 1;4(7):e2114723. doi: 10.1001/jamanetworkopen.2021.14723. JAMA Netw Open. 2021. PMID: 34232304 Free PMC article.
-
Development and Validation of an Electronic Health Record-Based Machine Learning Model to Estimate Delirium Risk in Newly Hospitalized Patients Without Known Cognitive Impairment.JAMA Netw Open. 2018 Aug 3;1(4):e181018. doi: 10.1001/jamanetworkopen.2018.1018. JAMA Netw Open. 2018. PMID: 30646095 Free PMC article.
-
Application of machine learning approaches in predicting clinical outcomes in older adults - a systematic review and meta-analysis.BMC Geriatr. 2023 Sep 14;23(1):561. doi: 10.1186/s12877-023-04246-w. BMC Geriatr. 2023. PMID: 37710210 Free PMC article.
-
Machine learning to predict adverse drug events based on electronic health records: a systematic review and meta-analysis.J Int Med Res. 2024 Dec;52(12):3000605241302304. doi: 10.1177/03000605241302304. J Int Med Res. 2024. PMID: 39668733 Free PMC article.
Cited by
-
Predicting complete cytoreduction for advanced ovarian cancer patients using nearest-neighbor models.J Ovarian Res. 2020 Sep 29;13(1):117. doi: 10.1186/s13048-020-00700-0. J Ovarian Res. 2020. PMID: 32993745 Free PMC article.
-
Predicting mortality over different time horizons: which data elements are needed?J Am Med Inform Assoc. 2017 Jan;24(1):176-181. doi: 10.1093/jamia/ocw057. Epub 2016 Jun 29. J Am Med Inform Assoc. 2017. PMID: 27357832 Free PMC article.
-
The predictive accuracy of PREDICT: a personalized decision-making tool for Southeast Asian women with breast cancer.Medicine (Baltimore). 2015 Feb;94(8):e593. doi: 10.1097/MD.0000000000000593. Medicine (Baltimore). 2015. PMID: 25715267 Free PMC article.
-
Developing a model to predict unfavourable treatment outcomes in patients with tuberculosis and human immunodeficiency virus co-infection in Delhi, India.PLoS One. 2018 Oct 3;13(10):e0204982. doi: 10.1371/journal.pone.0204982. eCollection 2018. PLoS One. 2018. PMID: 30281679 Free PMC article.
-
UEG Week 2020 Poster Presentations.United European Gastroenterol J. 2020 Oct;8(8_suppl):144-887. doi: 10.1177/2050640620927345. United European Gastroenterol J. 2020. PMID: 33043826 Free PMC article. No abstract available.
References
-
- Huang ML, Hung YH, Lee WM, et al. Usage of case-based reasoning, neural network and adaptive neuro-fuzzy inference system classification techniques in breast cancer dataset classification diagnosis. J Med Syst 2012;36:407–14 - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical