Developing predictive models using electronic medical records: challenges and pitfalls
- PMID: 24551396
- PMCID: PMC3900132
Developing predictive models using electronic medical records: challenges and pitfalls
Abstract
While Electronic Medical Records (EMR) contain detailed records of the patient-clinician encounter - vital signs, laboratory tests, symptoms, caregivers' notes, interventions prescribed and outcomes - developing predictive models from this data is not straightforward. These data contain systematic biases that violate assumptions made by off-the-shelf machine learning algorithms, commonly used in the literature to train predictive models. In this paper, we discuss key issues and subtle pitfalls specific to building predictive models from EMR. We highlight the importance of carefully considering both the special characteristics of EMR as well as the intended clinical use of the predictive model and show that failure to do so could lead to developing models that are less useful in practice. Finally, we describe approaches for training and evaluating models on EMR using early prediction of septic shock as our example application.
Figures
Similar articles
-
Medical decision support using machine learning for early detection of late-onset neonatal sepsis.J Am Med Inform Assoc. 2014 Mar-Apr;21(2):326-36. doi: 10.1136/amiajnl-2013-001854. Epub 2013 Sep 16. J Am Med Inform Assoc. 2014. PMID: 24043317 Free PMC article.
-
PREDICTIVE MODELING OF HOSPITAL READMISSION RATES USING ELECTRONIC MEDICAL RECORD-WIDE MACHINE LEARNING: A CASE-STUDY USING MOUNT SINAI HEART FAILURE COHORT.Pac Symp Biocomput. 2017;22:276-287. doi: 10.1142/9789813207813_0027. Pac Symp Biocomput. 2017. PMID: 27896982 Free PMC article.
-
Elucidating Discrepancy in Explanations of Predictive Models Developed Using EMR.Stud Health Technol Inform. 2024 Jan 25;310:865-869. doi: 10.3233/SHTI231088. Stud Health Technol Inform. 2024. PMID: 38269932
-
Reporting and Implementing Interventions Involving Machine Learning and Artificial Intelligence.Ann Intern Med. 2020 Jun 2;172(11 Suppl):S137-S144. doi: 10.7326/M19-0872. Ann Intern Med. 2020. PMID: 32479180 Review.
-
Introduction of patient electronic medical records (EMR) into undergraduate nursing education: An integrated literature review.Nurse Educ Today. 2020 Nov;94:104517. doi: 10.1016/j.nedt.2020.104517. Epub 2020 Jun 29. Nurse Educ Today. 2020. PMID: 32853983 Review.
Cited by
-
Clinical artificial intelligence quality improvement: towards continual monitoring and updating of AI algorithms in healthcare.NPJ Digit Med. 2022 May 31;5(1):66. doi: 10.1038/s41746-022-00611-y. NPJ Digit Med. 2022. PMID: 35641814 Free PMC article. Review.
-
Data-driven Temporal Prediction of Surgical Site Infection.AMIA Annu Symp Proc. 2015 Nov 5;2015:1164-73. eCollection 2015. AMIA Annu Symp Proc. 2015. PMID: 26958256 Free PMC article.
-
Integrated multisystem analysis in a mental health and criminal justice ecosystem.Health Justice. 2017 Dec;5(1):4. doi: 10.1186/s40352-017-0049-y. Epub 2017 Mar 22. Health Justice. 2017. PMID: 28332099 Free PMC article.
-
Machine Learning and Deep Learning Models for Early Sepsis Prediction: A Scoping Review.Indian J Crit Care Med. 2025 Jun;29(6):516-524. doi: 10.5005/jp-journals-10071-24986. Epub 2025 Jun 5. Indian J Crit Care Med. 2025. PMID: 40567322 Free PMC article. Review.
-
Early Detection of Sepsis With Machine Learning Techniques: A Brief Clinical Perspective.Front Med (Lausanne). 2021 Feb 12;8:617486. doi: 10.3389/fmed.2021.617486. eCollection 2021. Front Med (Lausanne). 2021. PMID: 33644097 Free PMC article.
References
-
- Hug C. Massachusetts Institute of Technology; Cambridge, MA: 2009. Detecting hazardous intensive care patient episodes using real-time mortality models [PhD Thesis]
-
- Fultz SL, Skanderson M, Mole LA, Gandhi N, Bryant K, Crystal S, et al. Development and verification of a “virtual” cohort using the National VA Health Information System. Medical care. 2006;44(8):S25–S30. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical