. 2021 Nov 25;3(1):56-66.

doi: 10.1093/ehjdh/ztab101. eCollection 2022 Mar.

Development of a machine learning model using electrocardiogram signals to improve acute pulmonary embolism screening

Sulaiman S Somani¹, Hossein Honarvar¹, Sukrit Narula², Isotta Landi¹, Shawn Lee³, Yeraz Khachatoorian⁴, Arsalan Rehmani³, Andrew Kim⁴, Jessica K De Freitas¹, Shelly Teng¹, Suraj Jaladanki¹, Arvind Kumar¹, Adam Russak¹, Shan P Zhao¹, Robert Freeman⁵, Matthew A Levin⁶, Girish N Nadkarni¹, Alexander C Kagen⁷, Edgar Argulian³, Benjamin S Glicksberg¹

Affiliations

¹ The Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, 770 Lexington Ave, 15th Fl, New York, NY, 10065, USA.
² Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, 20 Copeland Ave, Hamilton, ON L8L 2X2, Canada.
³ Department of Cardiology, Icahn School of Medicine at Mount Sinai, 1 Gustave L Levy Pl, New York, NY 10029, USA.
⁴ Department of Internal Medicine, Icahn School of Medicine at Mount Sinai, 1 Gustave L Levy Pl, New York, NY 10029, USA.
⁵ Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, 1 Gustave L Levy Pl, New York, NY 10029, USA.
⁶ Department of Anesthesiology, Perioperative, and Pain Medicine, Icahn School of Medicine at Mount Sinai, 1 Gustave L Levy Pl, New York, NY 10029, USA.
⁷ Department of Radiology, Icahn School of Medicine at Mount Sinai, 1 Gustave L Levy Pl, New York, NY 10029, USA.

PMID: 35355847
PMCID: PMC8946569
DOI: 10.1093/ehjdh/ztab101

Development of a machine learning model using electrocardiogram signals to improve acute pulmonary embolism screening

Sulaiman S Somani et al. Eur Heart J Digit Health. 2021.

. 2021 Nov 25;3(1):56-66.

doi: 10.1093/ehjdh/ztab101. eCollection 2022 Mar.

Authors

Affiliations

¹ The Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, 770 Lexington Ave, 15th Fl, New York, NY, 10065, USA.
² Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, 20 Copeland Ave, Hamilton, ON L8L 2X2, Canada.
³ Department of Cardiology, Icahn School of Medicine at Mount Sinai, 1 Gustave L Levy Pl, New York, NY 10029, USA.
⁴ Department of Internal Medicine, Icahn School of Medicine at Mount Sinai, 1 Gustave L Levy Pl, New York, NY 10029, USA.
⁵ Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, 1 Gustave L Levy Pl, New York, NY 10029, USA.
⁶ Department of Anesthesiology, Perioperative, and Pain Medicine, Icahn School of Medicine at Mount Sinai, 1 Gustave L Levy Pl, New York, NY 10029, USA.
⁷ Department of Radiology, Icahn School of Medicine at Mount Sinai, 1 Gustave L Levy Pl, New York, NY 10029, USA.

PMID: 35355847
PMCID: PMC8946569
DOI: 10.1093/ehjdh/ztab101

Abstract

Aims: Clinical scoring systems for pulmonary embolism (PE) screening have low specificity and contribute to computed tomography pulmonary angiogram (CTPA) overuse. We assessed whether deep learning models using an existing and routinely collected data modality, electrocardiogram (ECG) waveforms, can increase specificity for PE detection.

Methods and results: We create a retrospective cohort of 21 183 patients at moderate- to high suspicion of PE and associate 23 793 CTPAs (10.0% PE-positive) with 320 746 ECGs and encounter-level clinical data (demographics, comorbidities, vital signs, and labs). We develop three machine learning models to predict PE likelihood: an ECG model using only ECG waveform data, an EHR model using tabular clinical data, and a Fusion model integrating clinical data and an embedded representation of the ECG waveform. We find that a Fusion model [area under the receiver-operating characteristic curve (AUROC) 0.81 ± 0.01] outperforms both the ECG model (AUROC 0.59 ± 0.01) and EHR model (AUROC 0.65 ± 0.01). On a sample of 100 patients from the test set, the Fusion model also achieves greater specificity (0.18) and performance (AUROC 0.84 ± 0.01) than four commonly evaluated clinical scores: Wells' Criteria, Revised Geneva Score, Pulmonary Embolism Rule-Out Criteria, and 4-Level Pulmonary Embolism Clinical Probability Score (AUROC 0.50-0.58, specificity 0.00-0.05). The model is superior to these scores on feature sensitivity analyses (AUROC 0.66-0.84) and achieves comparable performance across sex (AUROC 0.81) and racial/ethnic (AUROC 0.77-0.84) subgroups.

Conclusion: Synergistic deep learning of ECG waveforms with traditional clinical variables can increase the specificity of PE detection in patients at least at moderate suspicion for PE.

Keywords: Deep learning; Electrocardiogram; Machine learning; Pulmonary embolism.

PubMed Disclaimer

Figures

**Figure 1**
Study design. (A) Our pipeline for creating models to detect pulmonary embolism consists of using three data modalities: electrocardiograms, clinical data [electronic health records (EHR)] including patient demographics, comorbidities, vital signs, and relevant labs, and computed tomography pulmonary angiograms that are labelled using a two-stage approach combining natural language processing pattern matching and manual clinician annotations. These data are linked together to develop, analyse, and benchmark models to predict pulmonary embolism. (B) We split our dataset for training, validation, and testing first by first identifying all unique patients (not unique computed tomography pulmonary angiogram or unique electrocardiogram) and separating them based on whether they have at least one PE-positive computed tomography pulmonary angiogram scan (PE+) or not (PE−). This stratum is further split into 90% for nine-fold cross-validation (89% for training, 11% for model selection and model development) and 10% for testing to assess model performance and benchmark against clinical scores. (C) Electrocardiograms are labelled as PE+ if they are recorded within 24 h of a PE+ computed tomography pulmonary angiogram. Electrocardiograms recorded 24 h after or between 6 months and 24 h before a positive computed tomography pulmonary angiogram are discarded. Electrocardiograms not meeting the above criteria for PE+ computed tomography pulmonary angiograms are labelled PE−. EHR data are retained if collected within 24 h of the computed tomography pulmonary angiogram and labelled equally with the computed tomography pulmonary angiogram finding.

**Figure 2**
Modelling overview, performance, and interpretability. (A) The electrocardiogram model, which is a convolutional neural network with residual connections, trains and infers pulmonary embolism likelihood using 10-s long waveform from 8 leads (I, II, V1–V6) recorded at 500 Hz. The EHR model is an Extreme Gradient Boosting (XGBoost) model that uses tabular clinical data (demographics, comorbidities, labs, and vital signs) and electrocardiogram morphology parameters to predict the likelihood of pulmonary embolism. Finally, the fusion model is an XGBoost model that uses a principal component decomposition of an electrocardiogram waveform embedding from the electrocardiogram model, tabular clinical data, and electrocardiogram morphology parameters in an XGBoost framework to predict the likelihood of pulmonary embolism. (B) Mean receiver-operating characteristic (top) and precision-recall (bottom) curves with 95% confidence intervals for the electrocardiogram (red), EHR (blue), and Fusion (orange) models, with the mean and standard deviations for the area under each respective curve (AUROC, AUPRC) in the figure legend. In top plot, the horizontal and vertical lines correspond to optimal threshold. The Fusion model outperforms both the electrocardiogram and EHR models. (C) SHAP dependency plots for the EHR model (top) and Fusion model (bottom), representing the marginal contribution from patient encounters in the test set (dots, coloured by value of feature) of different features (y-axis, in descending order of importance) on the model output (x-axis, positive favours increased pulmonary embolism likelihood). Grey dots represent samples with missing data points.

**Figure 3**
Clinical benchmark and integration. (A) Mean receiver-operating characteristic (ROC, left) and precision-recall (PRC, right) curves with 95% confidence intervals for the Fusion model with (pink) and without (brown) D-dimer, whereas ROC and PRC are shown for the clinical scores—Wells’ Criteria (yellow), Revise Geneva Score (green), PERC (red), and 4PEPS (purple). In top plot, the horizontal and vertical lines correspond to optimal threshold. Mean and standard deviations for the area under each respective curve (AUROC, AUPRC) for the Fusion models are displayed in the legend, whereas area under each respective curve (AUROC, AUPRC) are shown for the clinical scores. (B) The Fusion model may be used to recommend computed tomography pulmonary angiogram or exclude pulmonary embolism in patients with moderate to high likelihood of pulmonary embolism after clinical stratification or those at low suspicion with an abnormal D-dimer.

See this image and copyright information in PMC

Cited by

A novel multimodal computer-aided diagnostic model for pulmonary embolism based on hybrid transformer-CNN and tabular transformer.
Zhang W, Gu Y, Ma H, Yang L, Zhang B, Wang J, Chen M, Lu X, Li J, Liu X, Yu D, Zhao Y, Tang S, He Q. Zhang W, et al. Phys Eng Sci Med. 2025 May 24. doi: 10.1007/s13246-025-01568-4. Online ahead of print. Phys Eng Sci Med. 2025. PMID: 40411540
Screening for RV Dysfunction Using Smartphone ECG Analysis App: Validation Study with Acute Pulmonary Embolism Patients.
Choi YJ, Park MJ, Cho Y, Kim J, Lee E, Son D, Kim SY, Soh MS. Choi YJ, et al. J Clin Med. 2024 Aug 14;13(16):4792. doi: 10.3390/jcm13164792. J Clin Med. 2024. PMID: 39200934 Free PMC article.
Electrocardiogram Signal Analysis With a Machine Learning Model Predicts the Presence of Pulmonary Embolism With Accuracy Dependent on Embolism Burden.
Wysokinski WE, Meverden RA, Lopez-Jimenez F, Harmon DM, Medina Inojosa BJ, Suarez AB, Liu K, Medina Inojosa JR, Casanegra AI, McBane RD, Houghton DE. Wysokinski WE, et al. Mayo Clin Proc Digit Health. 2024 May 24;2(3):453-462. doi: 10.1016/j.mcpdig.2024.03.009. eCollection 2024 Sep. Mayo Clin Proc Digit Health. 2024. PMID: 40206108 Free PMC article.
Research progress of artificial intelligence and machine learning in pulmonary embolism.
Li Y, Zhang L, Liu H, Li Y, Liu Z. Li Y, et al. Front Med (Lausanne). 2025 Mar 27;12:1577559. doi: 10.3389/fmed.2025.1577559. eCollection 2025. Front Med (Lausanne). 2025. PMID: 40212275 Free PMC article. Review.
Risk stratification of chest pain in the emergency department using artificial intelligence applied to electrocardiograms.
Haimovich JS, Kolossváry M, Alam R, Padrós-Valls R, Lu MT, Aguirre AD. Haimovich JS, et al. Open Heart. 2025 Sep 1;12(2):e003343. doi: 10.1136/openhrt-2025-003343. Open Heart. 2025. PMID: 40889954 Free PMC article.

See all "Cited by" articles

References

1. Smith SB, Geske JB, Kathuria P, et al. Analysis of national trends in admissions for pulmonary embolism. Chest 2016;150:35–45. - PMC - PubMed
1. Huisman MV, Barco S, Cannegieter SC, et al. Pulmonary embolism. Nat Rev Dis Primers 2018;4:18028. - PubMed
1. Konstantinides SV, Meyer G, Becattini C, et al.; ESC Scientific Document Group. 2019 ESC Guidelines for the diagnosis and management of acute pulmonary embolism developed in collaboration with the European Respiratory Society (ERS). Eur Heart J 2020;41:543–603. - PubMed
1. Kline Jeffrey A, Garrett John S, Sarmiento Elisa J, Strachan Christian C, Mark CD. Over-testing for suspected pulmonary embolism in American Emergency Departments. Circ Cardiovasc Qual Outcomes 2020;13:e005753. - PubMed
1. Stacul F, Molen A. V D, Reimer P, et al.; Contrast Media Safety Committee of European Society of Urogenital Radiology (ESUR). Contrast induced nephropathy: updated ESUR Contrast Media Safety Committee guidelines. Eur Radiol 2011;21:2527–2541. - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Development of a machine learning model using electrocardiogram signals to improve acute pulmonary embolism screening

Affiliations

Development of a machine learning model using electrocardiogram signals to improve acute pulmonary embolism screening

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Figures

Similar articles

Cited by

References

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources