Information Extraction From Electronic Health Records to Predict Readmission Following Acute Myocardial Infarction: Does Natural Language Processing Using Clinical Notes Improve Prediction of Readmission?
- PMID: 35322668
- PMCID: PMC9075435
- DOI: 10.1161/JAHA.121.024198
Information Extraction From Electronic Health Records to Predict Readmission Following Acute Myocardial Infarction: Does Natural Language Processing Using Clinical Notes Improve Prediction of Readmission?
Abstract
Background Social risk factors influence rehospitalization rates yet are challenging to incorporate into prediction models. Integration of social risk factors using natural language processing (NLP) and machine learning could improve risk prediction of 30-day readmission following an acute myocardial infarction. Methods and Results Patients were enrolled into derivation and validation cohorts. The derivation cohort included inpatient discharges from Vanderbilt University Medical Center between January 1, 2007, and December 31, 2016, with a primary diagnosis of acute myocardial infarction, who were discharged alive, and not transferred from another facility. The validation cohort included patients from Dartmouth-Hitchcock Health Center between April 2, 2011, and December 31, 2016, meeting the same eligibility criteria described above. Data from both sites were linked to Centers for Medicare & Medicaid Services administrative data to supplement 30-day hospital readmissions. Clinical notes from each cohort were extracted, and an NLP model was deployed, counting mentions of 7 social risk factors. Five machine learning models were run using clinical and NLP-derived variables. Model discrimination and calibration were assessed, and receiver operating characteristic comparison analyses were performed. The 30-day rehospitalization rates among the derivation (n=6165) and validation (n=4024) cohorts were 15.1% (n=934) and 10.2% (n=412), respectively. The derivation models demonstrated no statistical improvement in model performance with the addition of the selected NLP-derived social risk factors. Conclusions Social risk factors extracted using NLP did not significantly improve 30-day readmission prediction among hospitalized patients with acute myocardial infarction. Alternative methods are needed to capture social risk factors.
Keywords: electronic health records; machine learning; myocardial infarction; natural language processing; patient readmission.
Figures

Similar articles
-
Augmenting the Hospital Score with social risk factors to improve prediction for 30-day readmission following acute myocardial infarction.Med Res Arch. 2024 Nov;12(11):6089. doi: 10.18103/mra.v12i11.6089. Med Res Arch. 2024. PMID: 39906889
-
Development of Electronic Health Record-Based Prediction Models for 30-Day Readmission Risk Among Patients Hospitalized for Acute Myocardial Infarction.JAMA Netw Open. 2021 Jan 4;4(1):e2035782. doi: 10.1001/jamanetworkopen.2020.35782. JAMA Netw Open. 2021. PMID: 33512518 Free PMC article.
-
Enhancing readmission prediction models by integrating insights from home healthcare notes: Retrospective cohort study.Int J Nurs Stud. 2024 Oct;158:104850. doi: 10.1016/j.ijnurstu.2024.104850. Epub 2024 Jul 3. Int J Nurs Stud. 2024. PMID: 39024965
-
Acute Myocardial Infarction Readmission Risk Prediction Models: A Systematic Review of Model Performance.Circ Cardiovasc Qual Outcomes. 2018 Jan;11(1):e003885. doi: 10.1161/CIRCOUTCOMES.117.003885. Circ Cardiovasc Qual Outcomes. 2018. PMID: 29321135 Free PMC article.
-
A review of socioeconomic factors associated with acute myocardial infarction-related mortality and hospital readmissions.Hosp Pract (1995). 2022 Feb;50(1):1-8. doi: 10.1080/21548331.2021.2022357. Epub 2022 Jan 6. Hosp Pract (1995). 2022. PMID: 34933647 Review.
Cited by
-
Augmenting the Hospital Score with social risk factors to improve prediction for 30-day readmission following acute myocardial infarction.Med Res Arch. 2024 Nov;12(11):6089. doi: 10.18103/mra.v12i11.6089. Med Res Arch. 2024. PMID: 39906889
-
AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease.J Biomed Inform. 2023 Aug;144:104442. doi: 10.1016/j.jbi.2023.104442. Epub 2023 Jul 8. J Biomed Inform. 2023. PMID: 37429512 Free PMC article.
-
Artificial intelligence: revolutionizing cardiology with large language models.Eur Heart J. 2024 Feb 1;45(5):332-345. doi: 10.1093/eurheartj/ehad838. Eur Heart J. 2024. PMID: 38170821 Free PMC article.
-
Automated Transformation of Unstructured Cardiovascular Diagnostic Reports into Structured Datasets Using Sequentially Deployed Large Language Models.medRxiv [Preprint]. 2024 Oct 8:2024.10.08.24315035. doi: 10.1101/2024.10.08.24315035. medRxiv. 2024. Update in: Eur Heart J Digit Health. 2025 Apr 02;6(4):783-796. doi: 10.1093/ehjdh/ztaf030. PMID: 39417094 Free PMC article. Updated. Preprint.
-
Automated transformation of unstructured cardiovascular diagnostic reports into structured datasets using sequentially deployed large language models.Eur Heart J Digit Health. 2025 Apr 2;6(4):783-796. doi: 10.1093/ehjdh/ztaf030. eCollection 2025 Jul. Eur Heart J Digit Health. 2025. PMID: 40703108 Free PMC article.
References
-
- Khera R, Jain S, Pandey A, Agusala V, Kumbhani DJ, Das SR, Berry JD, de Lemos JA, Girotra S. Comparison of readmission rates after acute myocardial infarction in 3 patient age groups (18 to 44, 45 to 64, and >/=65 years) in the United States. Am J Cardiol. 2017;120:1761–1767. doi: 10.1016/j.amjcard.2017.07.081 - DOI - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous