Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul:93:120-125.
doi: 10.1016/j.ajem.2025.03.065. Epub 2025 Apr 1.

Automated computation of the HEART score with the GPT-4 large language model

Affiliations

Automated computation of the HEART score with the GPT-4 large language model

Donald S Wright et al. Am J Emerg Med. 2025 Jul.

Abstract

Background: Automated computation of the HEART score has the potential to facilitate clinical decision support and safety interventions. The goal of this study was to assess the performance of the GPT-4 large language model (LLM) in computation of the HEART score and prediction of 60-day major adverse cardiac events (MACE).

Methods: In this retrospective cohort study from February 2022 to September 2023, patients admitted to a chest pain observation unit were identified. HEART scores were calculated by a physician assistant or nurse practitioner (APP) as part of routine care. Separately, the LLM calculated a HEART score utilizing an iteratively developed prompt from deidentified chart documentation. Any cases of disagreement with the APP score were adjudicated by an emergency physician blinded to clinical outcomes. Agreement on HEART score was assessed, and 60-day MACE was obtained via linkage to an institutional registry.

Results: Of the 601 participants, 50 were utilized for prompt development. Among the remaining 551 participants, agreement by Cohen's weighted kappa between the LLM and adjudicators was 0.67 which was similar to the agreement of 0.66 between the APP and adjudicators. The LLM predicted a higher average HEART score (mean 5.06) compared to the adjudicators (mean 4.69) or APP (mean 4.23). No significant difference was seen in diagnostic performance for 60-day MACE by DeLong pairwise comparison (all p > .05).

Conclusions: Automated risk score computation with language models has the potential to power interventions such as clinical decision support but has systematic differences from physician judgment. Prospective investigation is needed.

Keywords: Artificial Intelligence; ChatGPT; GPT; HEART Score; LLM; Large language model.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest None.

References

    1. Van Den Berg P, Body R. The HEART score for early rule out of acute coronary syndromes in the emergency department: a systematic review and meta-analysis. Eur Heart J Acute Cardiovasc Care. 2018;7(2):111–119. doi: 10.1177/2048872617710788 - DOI - PubMed
    1. Mark DG, Huang J, Kene MV, et al. Prospective Validation and Comparative Analysis of Coronary Risk Stratification Strategies Among Emergency Department Patients With Chest Pain. J Am Heart Assoc. 2021;10(7):e020082. doi: 10.1161/JAHA.120.020082 - DOI - PMC - PubMed
    1. Backus BE, Six AJ, Kelder JC, et al. Chest pain in the emergency room: a multicenter validation of the HEART Score. Crit Pathw Cardiol. 2010;9(3):164–169. doi: 10.1097/HPC.0b013e3181ec36d8 - DOI - PubMed
    1. Gulati M, Levy PD, Mukherjee D, et al. 2021 AHA/ACC/ASE/CHEST/SAEM/SCCT/SCMR Guideline for the Evaluation and Diagnosis of Chest Pain: A Report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines. Circulation. 2021;144(22):e368–e454. doi: 10.1161/CIR.0000000000001029 - DOI - PubMed
    1. Writing Committee, Kontos MC, de Lemos JA, et al. 2022 ACC Expert Consensus Decision Pathway on the Evaluation and Disposition of Acute Chest Pain in the Emergency Department: A Report of the American College of Cardiology Solution Set Oversight Committee. J Am Coll Cardiol. 2022;80(20):1925–1960. doi: 10.1016/j.jacc.2022.08.750 - DOI - PMC - PubMed

LinkOut - more resources