Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug;644(8075):221-230.
doi: 10.1038/s41586-025-09227-0. Epub 2025 Jul 16.

Detecting structural heart disease from electrocardiograms using AI

Affiliations

Detecting structural heart disease from electrocardiograms using AI

Timothy J Poterucha et al. Nature. 2025 Aug.

Abstract

Early detection of structural heart disease is critical to improving outcomes, but widespread screening remains limited by the cost and accessibility of imaging tools such as echocardiography1,2. Recent advances in machine learning applied to heart rhythm recordings have shown promise in identifying disease3,4, although previous work has been limited by development in narrow populations or targeting only select heart conditions5. Here we introduce a deep learning model, EchoNext, trained on more than 1 million heart rhythm and imaging records across a large and diverse health system to detect many forms of structural heart disease. The model demonstrated high diagnostic accuracy in internal and external validation, outperforming cardiologists in a controlled evaluation and showing consistent performance across different care settings and racial and/or ethnic groups. The models were prospectively evaluated in a clinical trial of patients without previous cardiac imaging, successfully identifying previously undiagnosed heart disease. These findings support the potential of artificial intelligence to expand access to heart disease screening at scale. To enable further development and transparency, we have publicly released model weights and a large, annotated dataset linking heart rhythm data to imaging-based diagnoses.

PubMed Disclaimer

Conflict of interest statement

Competing interests: Columbia University has submitted a patent application (no. 63/555,968) on the EchoNext ECG algorithm, on which T.J.P., L.J., C.M.H. and P.E. are inventors. T.J.P. owns stock in Abbott Laboratories and Baxter International, with research support provided to his institution from Eidos Therapeutics, Janssen, Pfizer and Edwards Lifesciences. E.M.D. serves on a clinical trial committee for Abiomed. R.T.H. has institutional consulting contracts with Abbott Structural, Edwards Lifesciences, Medtronic and Novartis, for which she receives no direct compensation. M.A.P. received a one-time research donation from Roche Diagnostics in 2023. B.D. is a consultant for Hone Health. R.A. and G.H.T. are co-inventors on the patent 63/208,406 (Method and System for Automated Analysis of Coronary Angiograms). R.A. reports receiving speaker fees from Abbott, Boston Scientific, Boehringer-Ingelheim and Novartis. G.H.T. is an adviser to Prolaio and Viz.ai, and has received research grants from Janssen Pharmaceuticals, General Electric and Myokardia, a wholly owned subsidiary of Bristol Meyers Squibb. P.E. has research support provided to his institution from Eidos Therapeutics, Pfizer, Janssen, Edwards Lifesciences and Google. The other authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Model development: NYP multicentre cohort derivation.
The deep learning model was trained and tested using data from an eight-hospital system (NYP Hospital). ECG data were accessed using the MUSE system with removal of ECGs with missing age, sex and patient identifier, poor study quality designation by machine recommending repeating of ECG or presence of ventricular pacing. Echocardiogram data were accessed using hospital systems with removal of patients with repaired or replaced heart failures. This yielded 1.2 million ECG–echocardiogram pairs in 230,018 unique patients with data split into train, validation and test sets.
Fig. 2
Fig. 2. Multicentre EchoNext performance.
Performance of the model in detection of individual and compositive SHDs. a,b, By AUROC (a) and AUPRC (b), the model had high performance in detection of SHD in the internal eight-hospital NYP system test set and three geographically distinct external test sets (Montreal Heart, Cedars-Sinai and University of California San Francisco (UCSF)). c, Individual disease models had the highest performance in the detection of reduced low left ventricular (LV) and right ventricular (RV) systolic function by AUROC with favourable performance for other disease states. d, Assessment of the AUPRC for the individual disease states is highly dependent on the underlying prevalence of the individual disease states. TR, tricuspid regurgitation. Dashed lines (a,c) indicate random classifier.
Fig. 3
Fig. 3. Performance characteristics of EchoNext in retrospective validation, comparison to cardiologists, silent deployment, and clinical trial.
a, In test sets of held-out patients at these sites, as well as in three geographically distinct external test sets, the model demonstrated high accuracy, as demonstrated by AUROC. Dashed line indicates random classifier. b, In a survey of ECGs shown to cardiologists to assess for the presence of SHD, the AI model demonstrated superior performance in SHD detection compared with cardiologists alone or cardiologists given the EchoNext risk score (n = 3,200 cardiologist interpretations). Error bars show the CIs derived from results across 13 cardiologists; because the AI model was run once on the entire set of 150 ECGs, there are no error bars for the ‘AI alone’ results. c, This model was evaluated in a temporally distinct held-out set with similar accuracy, with 45% (n = 3,444) of patients labelled as high risk by the model failing to undergo echocardiography as part of routine clinical care. d, The clinical use of AI-ECG to detect SHD was evaluated in a single-arm, single-site, open-label pilot clinical trial, DISCOVERY, with stratified recruitment of patients (N = 100) with an ECG but no previous echocardiogram. This trial used a related ECG model, ValveNet, which was trained to detect left-sided VHD (Left-VHD) with patients selected stratified sampling by their AI-ECG scores. This trial showed a high-level of discrimination in the detection of Left-VHD (primary endpoint) and SHD (secondary endpoint), and post hoc assessment using the second-generation EchoNext model demonstrated an even greater degree of risk stratification with 73% of patients in the highest risk and 6% of patients in the lowest-risk groups being found to have SHD. Left-VHD, moderate or severe aortic stenosis, aortic regurgitation or mitral regurgitation; SHD, LVEF less than or equal to 45%, low left ventricular wall thickness greater than or equal to 1.3 cm, moderate or severe right ventricular dysfunction, any moderate or severe VHD, PASP greater than or equal to 45 mm Hg or a moderate or large pericardial effusion.
Fig. 4
Fig. 4. Characteristics of the released Columbia ECG dataset and performance of the Columbia mini-model trained on these data for SHD prediction.
A 100,000 ECG dataset is being released from data from Columbia University Irving Medical Center as part of this paper. These data consist of the ECG waveform, ECG tabular features and paired echocardiographic data. A model trained and tested on this dataset demonstrated similar performance in SHD detection as the multisite model that served as the primary analysis in this study when assessed by AUROC and AUPRC. Dashed line indicates random classifier.

References

    1. Otto, C. M. et al. 2020 ACC/AHA guideline for the management of patients with valvular heart disease: a report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines. Circulation143, e72–e227 (2021). - PubMed
    1. Heidenreich, P. A. et al. 2022 AHA/ACC/HFSA guideline for the management of heart failure: executive summary: a report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines. J. Am. Coll. Cardiol.79, 1757–1780 (2022). - PubMed
    1. Elias, P. et al. Deep learning electrocardiographic analysis for detection of left-sided valvular heart disease. J. Am. Coll. Cardiol.80, 613–626 (2022). - PubMed
    1. Ulloa-Cerna, A. E. et al. rECHOmmend: an ECG-based machine learning approach for identifying patients at increased risk of undiagnosed structural heart disease detectable by echocardiography. Circulation146, 36–47 (2022). - PMC - PubMed
    1. Siontis, K. C., Noseworthy, P. A., Attia, Z. I. & Friedman, P. A. Artificial intelligence-enhanced electrocardiography in cardiovascular disease management. Nat. Rev. Cardiol.18, 465–478 (2021). - PMC - PubMed

LinkOut - more resources