. 2018 Dec 7;1(8):e185097.

doi: 10.1001/jamanetworkopen.2018.5097.

Validation of Prediction Models for Critical Care Outcomes Using Natural Language Processing of Electronic Health Record Data

Ben J Marafino^{1

2

3}, Miran Park^{1

2}, Jason M Davies^{1

2

4

5}, Robert Thombley^{1

2}, Harold S Luft⁶, David C Sing^{1

2

7}, Dhruv S Kazi^{8

9

10}, Colette DeJong^{1

2}, W John Boscardin⁹, Mitzi L Dean^{1

2}, R Adams Dudley^{1

2

10}

Affiliations

¹ Philip R. Lee Institute for Health Policy Studies, School of Medicine, University of California, San Francisco.
² Center for Healthcare Value, University of California, San Francisco.
³ currently with Biomedical Informatics Training Program, Stanford University School of Medicine, Stanford, California.
⁴ Department of Neurological Surgery, University of California, San Francisco.
⁵ Departments of Neurosurgery and Biomedical Informatics, University of Buffalo, Buffalo, New York.
⁶ Palo Alto Medical Foundation Research Institute, Palo Alto, California.
⁷ Department of Orthopedic Surgery, Boston Medical Center, Boston, Massachusetts.
⁸ Division of Cardiology, Zuckerberg San Francisco General Hospital, San Francisco, California.
⁹ Department of Epidemiology and Biostatistics, University of California, San Francisco.
¹⁰ Department of Medicine, University of California, San Francisco.

PMID: 30646310
PMCID: PMC6324323
DOI: 10.1001/jamanetworkopen.2018.5097

Validation of Prediction Models for Critical Care Outcomes Using Natural Language Processing of Electronic Health Record Data

Ben J Marafino et al. JAMA Netw Open. 2018.

. 2018 Dec 7;1(8):e185097.

doi: 10.1001/jamanetworkopen.2018.5097.

Authors

Affiliations

¹ Philip R. Lee Institute for Health Policy Studies, School of Medicine, University of California, San Francisco.
² Center for Healthcare Value, University of California, San Francisco.
³ currently with Biomedical Informatics Training Program, Stanford University School of Medicine, Stanford, California.
⁴ Department of Neurological Surgery, University of California, San Francisco.
⁵ Departments of Neurosurgery and Biomedical Informatics, University of Buffalo, Buffalo, New York.
⁶ Palo Alto Medical Foundation Research Institute, Palo Alto, California.
⁷ Department of Orthopedic Surgery, Boston Medical Center, Boston, Massachusetts.
⁸ Division of Cardiology, Zuckerberg San Francisco General Hospital, San Francisco, California.
⁹ Department of Epidemiology and Biostatistics, University of California, San Francisco.
¹⁰ Department of Medicine, University of California, San Francisco.

PMID: 30646310
PMCID: PMC6324323
DOI: 10.1001/jamanetworkopen.2018.5097

Abstract

Importance: Accurate prediction of outcomes among patients in intensive care units (ICUs) is important for clinical research and monitoring care quality. Most existing prediction models do not take full advantage of the electronic health record, using only the single worst value of laboratory tests and vital signs and largely ignoring information present in free-text notes. Whether capturing more of the available data and applying machine learning and natural language processing (NLP) can improve and automate the prediction of outcomes among patients in the ICU remains unknown.

Objectives: To evaluate the change in power for a mortality prediction model among patients in the ICU achieved by incorporating measures of clinical trajectory together with NLP of clinical text and to assess the generalizability of this approach.

Design, setting, and participants: This retrospective cohort study included 101 196 patients with a first-time admission to the ICU and a length of stay of at least 4 hours. Twenty ICUs at 2 academic medical centers (University of California, San Francisco [UCSF], and Beth Israel Deaconess Medical Center [BIDMC], Boston, Massachusetts) and 1 community hospital (Mills-Peninsula Medical Center [MPMC], Burlingame, California) contributed data from January 1, 2001, through June 1, 2017. Data were analyzed from July 1, 2017, through August 1, 2018.

Main outcomes and measures: In-hospital mortality and model discrimination as assessed by the area under the receiver operating characteristic curve (AUC) and model calibration as assessed by the modified Hosmer-Lemeshow statistic.

Results: Among 101 196 patients included in the analysis, 51.3% (n = 51 899) were male, with a mean (SD) age of 61.3 (17.1) years; their in-hospital mortality rate was 10.4% (n = 10 505). A baseline model using only the highest and lowest observed values for each laboratory test result or vital sign achieved a cross-validated AUC of 0.831 (95% CI, 0.830-0.832). In contrast, that model augmented with measures of clinical trajectory achieved an AUC of 0.899 (95% CI, 0.896-0.902; P < .001 for AUC difference). Further augmenting this model with NLP-derived terms associated with mortality further increased the AUC to 0.922 (95% CI, 0.916-0.924; P < .001). These NLP-derived terms were associated with improved model performance even when applied across sites (AUC difference for UCSF: 0.077 to 0.021; AUC difference for MPMC: 0.071 to 0.051; AUC difference for BIDMC: 0.035 to 0.043; P < .001) when augmenting with NLP at each site.

Conclusions and relevance: Intensive care unit mortality prediction models incorporating measures of clinical trajectory and NLP-derived terms yielded excellent predictive performance and generalized well in this sample of hospitals. The role of these automated algorithms, particularly those using unstructured data from notes and other sources, in clinical research and quality improvement seems to merit additional investigation.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Disclosures: None reported.

Comment in

Can Big Data Deliver on Its Promises?-Leaps but Not Bounds.
Peltan ID, Beesley SJ, Brown SM. Peltan ID, et al. JAMA Netw Open. 2018 Dec 7;1(8):e185694. doi: 10.1001/jamanetworkopen.2018.5694. JAMA Netw Open. 2018. PMID: 30646305 No abstract available.

References

1. Gunning K, Rowan K. ABC of intensive care: outcome data and scoring systems. BMJ. 1999;319(7204):-. doi: 10.1136/bmj.319.7204.241 - DOI - PMC - PubMed
1. Zimmerman JE, Kramer AA, McNair DS, Malila FM. Acute Physiology and Chronic Health Evaluation (APACHE) IV: hospital mortality assessment for today’s critically ill patients. Crit Care Med. 2006;34(5):1297-1310. doi: 10.1097/01.CCM.0000215112.84523.F0 - DOI - PubMed
1. Breslow MJ, Badawi O. Severity scoring in the critically ill: part 1—interpretation and accuracy of outcome prediction scoring systems. Chest. 2012;141(1):245-252. doi: 10.1378/chest.11-0330 - DOI - PubMed
1. Breslow MJ, Badawi O. Severity scoring in the critically ill: part 2—maximizing value from outcome prediction scoring systems. Chest. 2012;141(2):518-527. doi: 10.1378/chest.11-0331 - DOI - PubMed
1. Vincent J-L, Moreno R. Clinical review: scoring systems in the critically ill. Crit Care. 2010;14(2):207. doi: 10.1186/cc8204 - DOI - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

T15 LM007033/LM/NLM NIH HHS/United States

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Validation of Prediction Models for Critical Care Outcomes Using Natural Language Processing of Electronic Health Record Data

Affiliations

Validation of Prediction Models for Critical Care Outcomes Using Natural Language Processing of Electronic Health Record Data

Authors

Affiliations

Abstract

Conflict of interest statement

Comment in

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources