Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Apr 5;11(7):e024198.
doi: 10.1161/JAHA.121.024198. Epub 2022 Mar 24.

Information Extraction From Electronic Health Records to Predict Readmission Following Acute Myocardial Infarction: Does Natural Language Processing Using Clinical Notes Improve Prediction of Readmission?

Affiliations

Information Extraction From Electronic Health Records to Predict Readmission Following Acute Myocardial Infarction: Does Natural Language Processing Using Clinical Notes Improve Prediction of Readmission?

Jeremiah R Brown et al. J Am Heart Assoc. .

Abstract

Background Social risk factors influence rehospitalization rates yet are challenging to incorporate into prediction models. Integration of social risk factors using natural language processing (NLP) and machine learning could improve risk prediction of 30-day readmission following an acute myocardial infarction. Methods and Results Patients were enrolled into derivation and validation cohorts. The derivation cohort included inpatient discharges from Vanderbilt University Medical Center between January 1, 2007, and December 31, 2016, with a primary diagnosis of acute myocardial infarction, who were discharged alive, and not transferred from another facility. The validation cohort included patients from Dartmouth-Hitchcock Health Center between April 2, 2011, and December 31, 2016, meeting the same eligibility criteria described above. Data from both sites were linked to Centers for Medicare & Medicaid Services administrative data to supplement 30-day hospital readmissions. Clinical notes from each cohort were extracted, and an NLP model was deployed, counting mentions of 7 social risk factors. Five machine learning models were run using clinical and NLP-derived variables. Model discrimination and calibration were assessed, and receiver operating characteristic comparison analyses were performed. The 30-day rehospitalization rates among the derivation (n=6165) and validation (n=4024) cohorts were 15.1% (n=934) and 10.2% (n=412), respectively. The derivation models demonstrated no statistical improvement in model performance with the addition of the selected NLP-derived social risk factors. Conclusions Social risk factors extracted using NLP did not significantly improve 30-day readmission prediction among hospitalized patients with acute myocardial infarction. Alternative methods are needed to capture social risk factors.

Keywords: electronic health records; machine learning; myocardial infarction; natural language processing; patient readmission.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Percentage calibrated for test on Vanderbilt University Medical Center (VUMC) using structured clinical data (SCD) only and SCD with natural language processing–derived social risk factors (main effects) (NSRF), scored on Dartmouth‐Hitchcock Medical Center (DHMC).
Bars represent the percentage of aligned risk predictions. Model 1, SCD. Model 2, SCD+NSRF. Default, models with untuned hyperparameters. Optimized, models with tuned hyperparameters. LASSO indicates least absolute shrinkage and selection operator.

Similar articles

Cited by

References

    1. Mozaffarian D, Benjamin EJ, Go AS, Arnett DK, Blaha MJ, Cushman M, de Ferranti S, Després J‐P, Fullerton HJ, Howard VJ, et al. Executive summary: heart disease and stroke statistics—2015 update. Circulation. 2015;131:434–441. doi: 10.1161/cir.0000000000000157 - DOI - PubMed
    1. Khera R, Jain S, Pandey A, Agusala V, Kumbhani DJ, Das SR, Berry JD, de Lemos JA, Girotra S. Comparison of readmission rates after acute myocardial infarction in 3 patient age groups (18 to 44, 45 to 64, and >/=65 years) in the United States. Am J Cardiol. 2017;120:1761–1767. doi: 10.1016/j.amjcard.2017.07.081 - DOI - PMC - PubMed
    1. Bernheim SM, Parzynski CS, Horwitz L, Lin Z, Araas MJ, Ross JS, Drye EE, Suter LG, Normand SL, Krumholz HM. Accounting for patients' socioeconomic status does not change hospital readmission rates. Health Aff (Millwood). 2016;35:1461–1470. doi: 10.1377/hlthaff.2015.0394 - DOI - PMC - PubMed
    1. Wasfy JH, Bhambhani V, Healy EW, Choirat C, Dominici F, Wadhera RK, Shen C, Wang Y, Yeh RW, et al. Relative effects of the hospital readmissions reduction program on hospitals that serve poorer patients. Med Care. 2019;57:968–976. doi: 10.1097/MLR.0000000000001207 - DOI - PMC - PubMed
    1. Navathe AS, Zhong F, Lei VJ, Chang FY, Sordo M, Topaz M, Navathe SB, Rocha RA, Zhou L. Hospital readmission and social risk factors identified from physician notes. Health Serv Res. 2018;53:1110–1136. doi: 10.1111/1475-6773.12670 - DOI - PMC - PubMed

Publication types