This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2025 Mar 5:2025.03.02.25322575.

doi: 10.1101/2025.03.02.25322575.

Foundation models for generalizable electrocardiogram interpretation: comparison of supervised and self-supervised electrocardiogram foundation models

Alexis Nolin-Lapalme^{1

2

3

4}, Achille Sowa^{1

2

4}, Jacques Delfrate^{2

4}, Olivier Tastet², Denis Corbin², Merve Kulbay², Derman Ozdemir^{2

5}, Marie-Jeanne Noël², François-Christophe Marois-Blanchet⁶, François Harvey⁶, Surbhi Sharma⁷, Minhaj Ansari⁸, I-Min Chiu⁹, Valentina Dsouza¹⁰, Sam F Friedman¹⁰, Michaël Chassé⁶, Brian J Potter¹¹, Jonathan Afilalo¹², Pierre Adil Elias¹³, Gilbert Jabbour², Mourad Bahani², Marie-Pierre Dubé², Patrick M Boyle⁷, Neal A Chatterjee⁷, Joshua Barrios⁸, Geoffrey H Tison⁸, David Ouyang⁹, Mahnaz Maddah¹⁰, Shaan Khurshid^{14

15

16}, Julia Cadrin-Tourigny², Rafik Tadros², Julie Hussin^{1

2

3}, Robert Avram^{1

2

4}

Affiliations

¹ Department of Biochemistry and Molecular Medicine, Faculty of Medicine, University of Montreal, Montreal, Quebec, Canada.
² Montreal Heart Institute, Department of Medicine, Montreal, Quebec, Canada.
³ Mila - Québec AI Institute, Montreal, Quebec, Canada.
⁴ Heartwise (heartwise.ai), Montreal Heart Institute, Montreal, Quebec, Canada.
⁵ Department of Medicine, Eastern New Mexico Medical Center, Roswell, NM, USA.
⁶ Centre Hospitalier de l'Université de Montréal (CHUM) Center for the Integration and Analysis of Medical Data (CITADEL), Montreal, Canada.
⁷ University of Washington, Seattle, Washington, United States.
⁸ Division of Cardiology, Department of Medicine, University of California-San Francisco, San Francisco, California, USA.
⁹ Department of Cardiology, Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
¹⁰ Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, MA, USA.
¹¹ Centre Hospitalier de l'Université de Montréal (CHUM) Cardiovascular Center & Research Center, Montreal, Canada.
¹² Division of Cardiology and Centre for Clinical Epidemiology, Jewish General Hospital, McGill University, Montreal, Quebec, Canada.
¹³ Department of Biomedical Informatics, Columbia University Irving Medical Center, NY, USA.
¹⁴ Cardiovascular Disease Initiative, Broad Institute of Harvard University and MIT, Cambridge, MA, USA.
¹⁵ Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA.
¹⁶ Telemachus and Irene Demoulas Family Foundation Center for Cardiac Arrhythmias, Massachusetts General Hospital, Boston, MA, USA.

PMID: 40093248
PMCID: PMC11908279
DOI: 10.1101/2025.03.02.25322575

Foundation models for generalizable electrocardiogram interpretation: comparison of supervised and self-supervised electrocardiogram foundation models

Alexis Nolin-Lapalme et al. medRxiv. 2025.

[Preprint]. 2025 Mar 5:2025.03.02.25322575.

doi: 10.1101/2025.03.02.25322575.

Authors

Affiliations

¹ Department of Biochemistry and Molecular Medicine, Faculty of Medicine, University of Montreal, Montreal, Quebec, Canada.
² Montreal Heart Institute, Department of Medicine, Montreal, Quebec, Canada.
³ Mila - Québec AI Institute, Montreal, Quebec, Canada.
⁴ Heartwise (heartwise.ai), Montreal Heart Institute, Montreal, Quebec, Canada.
⁵ Department of Medicine, Eastern New Mexico Medical Center, Roswell, NM, USA.
⁶ Centre Hospitalier de l'Université de Montréal (CHUM) Center for the Integration and Analysis of Medical Data (CITADEL), Montreal, Canada.
⁷ University of Washington, Seattle, Washington, United States.
⁸ Division of Cardiology, Department of Medicine, University of California-San Francisco, San Francisco, California, USA.
⁹ Department of Cardiology, Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
¹⁰ Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, MA, USA.
¹¹ Centre Hospitalier de l'Université de Montréal (CHUM) Cardiovascular Center & Research Center, Montreal, Canada.
¹² Division of Cardiology and Centre for Clinical Epidemiology, Jewish General Hospital, McGill University, Montreal, Quebec, Canada.
¹³ Department of Biomedical Informatics, Columbia University Irving Medical Center, NY, USA.
¹⁴ Cardiovascular Disease Initiative, Broad Institute of Harvard University and MIT, Cambridge, MA, USA.
¹⁵ Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA.
¹⁶ Telemachus and Irene Demoulas Family Foundation Center for Cardiac Arrhythmias, Massachusetts General Hospital, Boston, MA, USA.

PMID: 40093248
PMCID: PMC11908279
DOI: 10.1101/2025.03.02.25322575

Abstract

Background: The 12-lead electrocardiogram (ECG) remains a cornerstone of cardiac diagnostics, yet existing artificial intelligence (AI) solutions for automated interpretation often lack generalizability, remain closed-source, and are primarily trained using supervised learning, limiting their adaptability across diverse clinical settings. To address these challenges, we developed and compared two open-source foundational ECG models: DeepECG-SSL, a self-supervised learning model, and DeepECG-SL, a supervised learning model.

Methods: Both models were trained on over 1 million ECGs using a standardized preprocessing pipeline and automated free-text extraction from ECG reports to predict 77 cardiac conditions. DeepECG-SSL was pretrained using self-supervised contrastive learning and masked lead modeling. The models were evaluated on six multilingual private healthcare systems and four public datasets for ECG interpretation across 77 diagnostic categories. Fairness analyses assessed disparities in performance across age and sex groups, while also investigating fairness and resource utilization.

Results: DeepECG-SSL achieved AUROCs of 0.990 (95%CI 0.990, 0.990) on internal dataset, 0.981 (95%CI 0.981, 0.981) on external public datasets, and 0.983 (95%CI 0.983, 0.983) on external private datasets, while DeepECG-SL demonstrated AUROCs of 0.992 (95%CI 0.992, 0.992), 0.980 (95%CI 0.980, 0.980) and 0.983 (95%CI 0.983, 0.983) respectively. Fairness analyses revealed minimal disparities (true positive rate & false positive rate difference<0.010) across age and sex groups. Digital biomarker prediction (Long QT syndrome (LQTS) classification, 5-year atrial fibrillation prediction and left ventricular ejection fraction (LVEF) classification) with limited labeled data, DeepECG-SSL outperformed DeepECG-SL in predicting 5-year atrial fibrillation risk (N=132,050; AUROC 0.742 vs. 0.720; Δ=0.022; P<0.001), identifying reduced LVEF ≤40% (N=25,252; 0.928 vs. 0.900; Δ=0.028; P<0.001), and classifying LQTS syndrome subtypes (N=127; 0.931 vs. 0.853; Δ=0.078; P=0.026).

Conclusion: By releasing model weights, preprocessing tools, and validation code, we aim to support robust, data-efficient AI diagnostics across diverse clinical environments. This study establishes self-supervised learning as a promising paradigm for ECG analysis, particularly in settings with limited annotated data, enhancing accessibility, generalizability, and fairness in AI-driven cardiac diagnostics.

Keywords: Artificial intelligence; Electrocardiogram; Fairness; Foundation model; Generalizability; Privacy.

PubMed Disclaimer

Figures

**Figure 1.. An overview of DeepECG-SL and DeepECG-SSL performances.**
We report overall AUROC and P-value computed using the DeLong Test. A Overall AUROC of ECG interpretation across all datasets. B AUROC of ECG interpretation categories on MHI-ds, EPD and EHC. C. AUROC of ECG interpretation categories on MHI-ds, EPD and EHC.D AUROC of digital biomarkers across datasets. iAF5 is at ECG level **Abbreviations**: **CSH** (Cedars Sinai Hospital Dataset), **CLSA** (Canadian Longitudinal Study on Aging Dataset), **ECG** (Electrocardiogram), **EHC** (External Health Centers Dataset), **EPD** (External Public Dataset), **JGH** (Jewish General Hospital Dataset), **LQTS** (Long QT Syndrome), **LVEF** (Left Ventricular Ejection Fraction), **MGH** (Massachusetts General Hospital Dataset), **MHI** (Montreal Heart Institute Dataset), **MIMIC-IV** (Medical Information Mart for Intensive Care IV Dataset), **NYP** (New York-Presbyterian Hospital Dataset), **PTB** (Physikalisch-Technische Bundesanstalt Dataset), **UKB** (UK Biobank Dataset), **UCSF** (University of California San Francisco Medical Center Dataset), UW (University of Washington Medical Center), **CHUM** (Centre hospitalier de l’Université de Montréal), and **WCR** (Wave2Vec + Contrastive Multi-segment Coding + Random Lead Masking).

See this image and copyright information in PMC

References

1. Holst H., Ohlsson M., Peterson C. & Edenbrandt L. A confident decision support system for interpreting electrocardiograms. Clin. Physiol. 19, 410–418 (1999). - PubMed
1. Martínez-Sellés M. & Marina-Breysse M. Current and Future Use of Artificial Intelligence in Electrocardiography. JCDD 10, 175 (2023). - PMC - PubMed
1. Hannun A. Y. et al. Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network. Nat Med 25, 65–69 (2019). - PMC - PubMed
1. Yao X. et al. Artificial intelligence–enabled electrocardiograms for identification of patients with low ejection fraction: a pragmatic, randomized clinical trial. Nat Med 27, 815–819 (2021). - PubMed
1. Aras M. A. et al. Electrocardiogram Detection of Pulmonary Hypertension Using Deep Learning. J. Card. Fail. 29, 1017–1028 (2023). - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

This is a preprint.

Foundation models for generalizable electrocardiogram interpretation: comparison of supervised and self-supervised electrocardiogram foundation models

Affiliations

Foundation models for generalizable electrocardiogram interpretation: comparison of supervised and self-supervised electrocardiogram foundation models

Authors

Affiliations

Abstract

Figures

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources