Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2024 Winter;26(2):101068.
doi: 10.1016/j.jocmr.2024.101068. Epub 2024 Jul 28.

Generative Pre-trained Transformer 4 analysis of cardiovascular magnetic resonance reports in suspected myocarditis: A multicenter study

Affiliations
Multicenter Study

Generative Pre-trained Transformer 4 analysis of cardiovascular magnetic resonance reports in suspected myocarditis: A multicenter study

Kenan Kaya et al. J Cardiovasc Magn Reson. 2024 Winter.

Abstract

Background: Diagnosing myocarditis relies on multimodal data, including cardiovascular magnetic resonance (CMR), clinical symptoms, and blood values. The correct interpretation and integration of CMR findings require radiological expertise and knowledge. We aimed to investigate the performance of Generative Pre-trained Transformer 4 (GPT-4), a large language model, for report-based medical decision-making in the context of cardiac MRI for suspected myocarditis.

Methods: This retrospective study includes CMR reports from 396 patients with suspected myocarditis and eight centers, respectively. CMR reports and patient data including blood values, age, and further clinical information were provided to GPT-4 and radiologists with 1 (resident 1), 2 (resident 2), and 4 years (resident 3) of experience in CMR and knowledge of the 2018 Lake Louise Criteria. The final impression of the report regarding the radiological assessment of whether myocarditis is present or not was not provided. The performance of Generative pre-trained transformer 4 (GPT-4) and the human readers were compared to a consensus reading (two board-certified radiologists with 8 and 10 years of experience in CMR). Sensitivity, specificity, and accuracy were calculated.

Results: GPT-4 yielded an accuracy of 83%, sensitivity of 90%, and specificity of 78%, which was comparable to the physician with 1 year of experience (R1: 86%, 90%, 84%, p = 0.14) and lower than that of more experienced physicians (R2: 89%, 86%, 91%, p = 0.007 and R3: 91%, 85%, 96%, p < 0.001). GPT-4 and human readers showed a higher diagnostic performance when results from T1- and T2-mapping sequences were part of the reports, for residents 1 and 3 with statistical significance (p = 0.004 and p = 0.02, respectively).

Conclusion: GPT-4 yielded good accuracy for diagnosing myocarditis based on CMR reports in a large dataset from multiple centers and therefore holds the potential to serve as a diagnostic decision-supporting tool in this capacity, particularly for less experienced physicians. Further studies are required to explore the full potential and elucidate educational aspects of the integration of large language models in medical decision-making.

Keywords: Artificial intelligence; Cardiovascular magnetic resonance; Generative Pre-trained Transformer 4; Large language models; Myocarditis.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interests The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: David Maintz received speaker’s honoraria from Philips Healthcare. Jan Borggrefe received speaker’s honoraria from Siemens Healthineers. Simon Lennartz is a member of Editorial Board of Radiology and a Senior Deputy Editor of Radiology in Training. Otherwise, the authors declare no conflicts of interest and had full control over all data, and guarantee correctness.

Figures

ga1
Graphical abstract
Fig. 1
Fig. 1
Exemplary styles of reports being included in this study: free text with T1/T2-mapping, free text without T1/T2-mapping, structured report with T1- and T2-mapping, and structured report without T1/T2-mapping. LV left ventricular, EDV end-diastolic volume, ESV end-systolic volume, EF ejection fraction, CO cardiac output, CI cardiac index, HR heart rate, CT computed tomography, LV EDD left ventricular end-diastolic diameter, BSA body surface area, ED end-diastole, LGE late gadolinium enhancement, GPT-4 Generative Pre-trained Transformer 4.
Fig. 2
Fig. 2
Workflow of the study design. The reference standard was established by the diagnosis of myocarditis based on the assessment of two board-certified radiologists with 8 and 10 years of experience in cardiovascular imaging, respectively. GPT-4 Generative Pre-trained Transformer 4.
Fig. 3
Fig. 3
Confusion matrices for performance of GPT-4, residents 1, 2, and 3 compared to the reference standard. GPT-4: Generative Pre-trained Transformer 4.
Fig. 4
Fig. 4
Proofreading examples by GPT-4 based on the given clinical data, laboratory values, and the radiology report compared to the assessment of the human readers. CRP C-reactive protein, CK creatine kinase, CK-MB creatine kinase-MB, Hs-Trop high sensitive troponin. IVSD interventricular septum thickness. LV left ventricular, EDV end-diastolic volume, EF ejection fraction, BSA body surface area, GPT-4 Generative Pre-trained Transformer 4, CMR cardiovascular magnetic resonance .

References

    1. Phillips M., Robinowitz M., Higgins J.R., Boran K.J., Reed T., Virmani R. Sudden cardiac death in Air Force recruits. A 20-year review. JAMA. 1986;256:2696–2699. - PubMed
    1. Liu P.P., Mason J.W. Advances in the understanding of myocarditis. Circulation. 2001;104:1076–1082. doi: 10.1161/hc3401.095198. - DOI - PubMed
    1. Caforio A.L., Pankuweit S., Arbustini E., Basso C., Gimeno-Blanes J., Felix S.B., et al. Current state of knowledge on aetiology, diagnosis, management, and therapy of myocarditis: a position statement of the European Society of Cardiology Working Group on Myocardial and Pericardial Diseases. Eur Heart J. 2013;34:2636–2648. doi: 10.1093/eurheartj/eht210. - DOI - PubMed
    1. Friedrich M.G., Sechtem U., Schulz-Menger J., Holmvang G., Alakija P., Cooper LT., et al. Cardiovascular magnetic resonance in myocarditis: a JACC White Paper. J Am Coll Cardiol. 2009;53:1475–1487. doi: 10.1016/j.jacc.2009.02.007. - DOI - PMC - PubMed
    1. Kotanidis C.P., Bazmpani M.A., Haidich A.B., Karvounis C., Antoniades C., Karamitsos T.D. Diagnostic accuracy of cardiovascular magnetic resonance in acute myocarditis: a systematic review and meta-analysis. JACC Cardiovasc Imaging. 2018 Nov;11:1583–1590. doi: 10.1016/j.jcmg.2017.12.008. - DOI - PubMed

Publication types

LinkOut - more resources