. 2025 Sep 24;15(1):32744.

doi: 10.1038/s41598-025-14893-1.

A machine learning approach to identify patients at risk for long-term consequences after pulmonary embolism

Stephan Nopp^#¹, Clemens Spielvogel^#², Behnood Bikdeli^{3

4

5}, Ana Alberich-Conesa⁶, Luis Hernández-Blasco^{7

8}, Mª Luisa Peris^{9

10}, Remedios Otero^{11

12

13}, David Jiménez^{13

14

15}, Manuel Monreal^{13

16}, Cihan Ay¹⁷; RIETE Investigators

Collaborators, Affiliations

Collaborators

RIETE Investigators:
P Agudo de Blas, J Aibar, A Alberich-Conesa, A Alda-Lozano, J Alfonso, J Alonso-Carrillo, P Álvarez-Vega, C Amado, M Angelina-García, J I Arcelus, A Ballaz, R Barba, C Barbagelata, M Barrón, B Barrón-Andrés, J Bascuñana, Á Blanco-Molina, J Bustos-Carpio, I Casado, L Chasco, G Claver, C De Juana-Izquierdo, J Del Toro, P Demelo-Rodríguez, A M Díaz-Brasero, M C Díaz-Pedroche, J A Díaz-Peromingo, Á Dubois-Silva, J C Escribano, C Fernández-Capitán, B Fernández-Jiménez, J L Fernández-Reyes, M A Fidalgo, I Francisco, C Gabara, F Galeano-Valle, F García-Bragado, C García-González, A García-Ortega, O Gavín-Sebastián, M A Gil de Gómez, A Gil-Díaz, C Gómez-Cepeda, C Gómez-Cuervo, J González-Martínez, A González-Munera, J Gorostidi, E Grau, L Guirado, J Gutiérrez-Guisado, L Hernández-Blasco, L Jara-Palomares, D Jiménez, I Jou, M D Joya, S Láinez-Justo, R Lecumberri, J L Lobo, L López-Jiménez, P López-Miguel, J J López-Núñez, A López-Ruiz, J B López-Sáez, A Lorenzo, O Madridano, A Maestre, P J Marchena, M Marcos, M Martín-Del Pozo, F Martín-Martos, J M Maza, M I Mercado, A Molino, M Monreal, L Monzón, M S Navas, J A Nieto, M J Núñez-Fernández, L Ordieres, O Ortiz, S Otálora, R Otero, N Pacheco-Gómez, J Pagán, P Parra-Caballero, J M Pedrajas, J Pérez-Amorós, A Pérez-Cabezas, C Pérez-Ductor, M Pérez-Pinar, M L Peris, M L Pesce, J A Porras, R Puchades, G Puche, A Rivas, F Rivera-Cívico, A Rodríguez-Cobo, M Romero-Bruguera, G Salgueiro, A Sánchez-Juez, T Sancho, V Sendín, P Sigüenza, S Soler, L A Sota-Yoldi, S Suárez-Fernández, R Tirado, A Torrents-Vilar, M I Torres, J Trujillo-Santos, F Uresandi, R Valle, J F Varona, E Vázquez, A Villalobos, C Villarejo, P Villares, C Ay, S Nopp, I Pabinger, T Vanassche, P Verhamme, A Verstraete, A T Rocha, H H B Yoo, A C Montenegro, S N Morales, J Roa, J Hirmerova, R Malý, S Acassat, L Bertoletti, M Brehon, A Bura-Riviere, J Catella, R Chopard, F Couturaud, O Espitia, R Le Mao, B Leclercq, I Mahé, F Moustafa, L Plaisance, G Poenou, I Quéré, G Sarlon-Bartoli, P Suchon, E Versini, S Schellong, F Rashidi, P Sadeghipour, F Tahmasbi, B Brenner, G Kennet, I Tzoran, G Barillari, M Basaglia, F Bilora, D Bissacco, B Brandolin, R Casana, M M Ciammaichella, D Colaizzo, P Di Micco, M Giorgi-Pierfranceschi, E Grandone, D Lambertenghi-Deliliers, C Marcon, A Poz, P Prandoni, P Simioni, C Siniscalchi, B Taflaj, A Tufano, A Visonà, B Zalunardo, A Skride, Z Tazi-Mezalek, S Fonseca, R Marques, J Meireles, S Pinto, M Bosevski, M Zdraveska, S Barco, H Bounameaux, S Keller, L Mazzolai, E Porceddu, A Aujayeb, D J Angiolillo, B Bikdeli, J A Caprini, A Khalil, L Ortega-Paz, J Tafur, I Weinberg, H M Bui

Affiliations

¹ Clinical Division of Haematology and Haemostaseology, Department of Medicine I, Medical University of Vienna, Waehringer Guertel 18-20, 1090, Vienna, Austria.
² Division of Nuclear Medicine, Department of Biomedical Imaging and Image-Guided Therapy, Medical University of Vienna, Vienna, Austria.
³ Cardiovascular Medicine Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
⁴ Thrombosis Research Group, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
⁵ YNHH/Yale Center for Outcomes Research and Evaluation (CORE), New Haven, CT, USA.
⁶ Department of Internal Medicine, Hospital Universitari de Girona Dr. Josep Trueta, Gerona, Spain.
⁷ Department of Pulmonology, Dr. Balmis General University Hospital, ISABIAL, Alicante, Spain.
⁸ Department of Clinical Medicine, UMH. Alicante, Sant Joan d'Alacant, Spain.
⁹ Department of Internal Medicine, Consorcio Hospitalario Provincial de Castellón, Castellón de La Plana, Castellón, Spain.
¹⁰ Universidad Cardenal Herrera-CEU, CEU Universities, Valencia, Spain.
¹¹ Medical Surgical Unit of Respiratory Diseases, Hospital Universitario Virgen del Rocío, Seville, Spain.
¹² Consejo Superior de Investigaciones Cientificas (CSIC), Instituto de Biomedicina de Sevilla (IBiS), Universidad de Sevilla, Seville, Spain.
¹³ CIBER de Enfermedades Respiratorias (CIBERES), Instituto de Salud Carlos III, Madrid, Spain.
¹⁴ Respiratory Department, Hospital Ramón y Cajal (IRYCIS), Madrid, Spain.
¹⁵ Medicine Department, Universidad de Alcalá, Madrid, Spain.
¹⁶ Director of the Chair for Thromboembolic Disorders, Universidad Católica de Murcia, Murcia, Spain.
¹⁷ Clinical Division of Haematology and Haemostaseology, Department of Medicine I, Medical University of Vienna, Waehringer Guertel 18-20, 1090, Vienna, Austria. cihan.ay@meduniwien.ac.at.

^# Contributed equally.

PMID: 40993195
PMCID: PMC12460599
DOI: 10.1038/s41598-025-14893-1

A machine learning approach to identify patients at risk for long-term consequences after pulmonary embolism

Stephan Nopp et al. Sci Rep. 2025.

. 2025 Sep 24;15(1):32744.

doi: 10.1038/s41598-025-14893-1.

Authors

Collaborators

RIETE Investigators:
P Agudo de Blas, J Aibar, A Alberich-Conesa, A Alda-Lozano, J Alfonso, J Alonso-Carrillo, P Álvarez-Vega, C Amado, M Angelina-García, J I Arcelus, A Ballaz, R Barba, C Barbagelata, M Barrón, B Barrón-Andrés, J Bascuñana, Á Blanco-Molina, J Bustos-Carpio, I Casado, L Chasco, G Claver, C De Juana-Izquierdo, J Del Toro, P Demelo-Rodríguez, A M Díaz-Brasero, M C Díaz-Pedroche, J A Díaz-Peromingo, Á Dubois-Silva, J C Escribano, C Fernández-Capitán, B Fernández-Jiménez, J L Fernández-Reyes, M A Fidalgo, I Francisco, C Gabara, F Galeano-Valle, F García-Bragado, C García-González, A García-Ortega, O Gavín-Sebastián, M A Gil de Gómez, A Gil-Díaz, C Gómez-Cepeda, C Gómez-Cuervo, J González-Martínez, A González-Munera, J Gorostidi, E Grau, L Guirado, J Gutiérrez-Guisado, L Hernández-Blasco, L Jara-Palomares, D Jiménez, I Jou, M D Joya, S Láinez-Justo, R Lecumberri, J L Lobo, L López-Jiménez, P López-Miguel, J J López-Núñez, A López-Ruiz, J B López-Sáez, A Lorenzo, O Madridano, A Maestre, P J Marchena, M Marcos, M Martín-Del Pozo, F Martín-Martos, J M Maza, M I Mercado, A Molino, M Monreal, L Monzón, M S Navas, J A Nieto, M J Núñez-Fernández, L Ordieres, O Ortiz, S Otálora, R Otero, N Pacheco-Gómez, J Pagán, P Parra-Caballero, J M Pedrajas, J Pérez-Amorós, A Pérez-Cabezas, C Pérez-Ductor, M Pérez-Pinar, M L Peris, M L Pesce, J A Porras, R Puchades, G Puche, A Rivas, F Rivera-Cívico, A Rodríguez-Cobo, M Romero-Bruguera, G Salgueiro, A Sánchez-Juez, T Sancho, V Sendín, P Sigüenza, S Soler, L A Sota-Yoldi, S Suárez-Fernández, R Tirado, A Torrents-Vilar, M I Torres, J Trujillo-Santos, F Uresandi, R Valle, J F Varona, E Vázquez, A Villalobos, C Villarejo, P Villares, C Ay, S Nopp, I Pabinger, T Vanassche, P Verhamme, A Verstraete, A T Rocha, H H B Yoo, A C Montenegro, S N Morales, J Roa, J Hirmerova, R Malý, S Acassat, L Bertoletti, M Brehon, A Bura-Riviere, J Catella, R Chopard, F Couturaud, O Espitia, R Le Mao, B Leclercq, I Mahé, F Moustafa, L Plaisance, G Poenou, I Quéré, G Sarlon-Bartoli, P Suchon, E Versini, S Schellong, F Rashidi, P Sadeghipour, F Tahmasbi, B Brenner, G Kennet, I Tzoran, G Barillari, M Basaglia, F Bilora, D Bissacco, B Brandolin, R Casana, M M Ciammaichella, D Colaizzo, P Di Micco, M Giorgi-Pierfranceschi, E Grandone, D Lambertenghi-Deliliers, C Marcon, A Poz, P Prandoni, P Simioni, C Siniscalchi, B Taflaj, A Tufano, A Visonà, B Zalunardo, A Skride, Z Tazi-Mezalek, S Fonseca, R Marques, J Meireles, S Pinto, M Bosevski, M Zdraveska, S Barco, H Bounameaux, S Keller, L Mazzolai, E Porceddu, A Aujayeb, D J Angiolillo, B Bikdeli, J A Caprini, A Khalil, L Ortega-Paz, J Tafur, I Weinberg, H M Bui

Affiliations

¹ Clinical Division of Haematology and Haemostaseology, Department of Medicine I, Medical University of Vienna, Waehringer Guertel 18-20, 1090, Vienna, Austria.
² Division of Nuclear Medicine, Department of Biomedical Imaging and Image-Guided Therapy, Medical University of Vienna, Vienna, Austria.
³ Cardiovascular Medicine Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
⁴ Thrombosis Research Group, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
⁵ YNHH/Yale Center for Outcomes Research and Evaluation (CORE), New Haven, CT, USA.
⁶ Department of Internal Medicine, Hospital Universitari de Girona Dr. Josep Trueta, Gerona, Spain.
⁷ Department of Pulmonology, Dr. Balmis General University Hospital, ISABIAL, Alicante, Spain.
⁸ Department of Clinical Medicine, UMH. Alicante, Sant Joan d'Alacant, Spain.
⁹ Department of Internal Medicine, Consorcio Hospitalario Provincial de Castellón, Castellón de La Plana, Castellón, Spain.
¹⁰ Universidad Cardenal Herrera-CEU, CEU Universities, Valencia, Spain.
¹¹ Medical Surgical Unit of Respiratory Diseases, Hospital Universitario Virgen del Rocío, Seville, Spain.
¹² Consejo Superior de Investigaciones Cientificas (CSIC), Instituto de Biomedicina de Sevilla (IBiS), Universidad de Sevilla, Seville, Spain.
¹³ CIBER de Enfermedades Respiratorias (CIBERES), Instituto de Salud Carlos III, Madrid, Spain.
¹⁴ Respiratory Department, Hospital Ramón y Cajal (IRYCIS), Madrid, Spain.
¹⁵ Medicine Department, Universidad de Alcalá, Madrid, Spain.
¹⁶ Director of the Chair for Thromboembolic Disorders, Universidad Católica de Murcia, Murcia, Spain.
¹⁷ Clinical Division of Haematology and Haemostaseology, Department of Medicine I, Medical University of Vienna, Waehringer Guertel 18-20, 1090, Vienna, Austria. cihan.ay@meduniwien.ac.at.

^# Contributed equally.

PMID: 40993195
PMCID: PMC12460599
DOI: 10.1038/s41598-025-14893-1

Abstract

Pulmonary embolism (PE) can result in long-term sequelae, such as post-PE syndrome, including persistent dyspnea and chronic thromboembolic pulmonary hypertension (CTEPH). Existing prediction tools for severe post-PE complications lack sensitivity and specificity. This study aimed to develop a machine learning model to identify patients at risk for long-term consequences after PE. Using data from the RIETE registry, the largest prospective international PE registry, we developed supervised machine learning models to identify patients at increased risk of CTEPH and post-PE syndrome. Our approach involved data preprocessing, model training via random forest algorithm, and validation through Monte-Carlo cross-validation. The performance of the CTEPH prediction model was benchmarked against an existing score. Of the 57,981 PE patients in the RIETE registry, 5,217 were eligible for inclusion. Median age was 68 years, with 50.6% men. Machine learning was based on 111 predictor variables, with 171 patients (3.3%) developing CTEPH. The CTEPH model demonstrated good performance with an AUC of 0.74 (95%CI: 0.73-0.75), significantly outperforming the existing CTEPH prediction score (0.57; 0.54-0.61). Additionally, 1,310 (25.1%) patients were defined as having post-PE syndrome six months after index PE. The post-PE syndrome model showed poorer performance with an AUC of 0.62 (0.61-0.62). Key predictor variables across both models included chest pain at presentation, PE location, troponin, side of clot, and dyspnea at presentation. Machine learning models show promise in predicting CTEPH but are less effective for post-PE syndrome. Future refinement, including integrating imaging data, is necessary to improve predictive performance and clinical utility.

Keywords: Dyspnea; Machine learning; Prediction; Pulmonary arterial hypertension; Pulmonary embolism; Venous thromboembolism.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

**Fig. 2**
Machine learning workflow. Machine learning workflow consisting of three major steps, preprocessing, model training and model validation. Data was split into cross-validation folds before fold-wise preprocessing, training of a random forest classifier, performance assessment and subsequent accumulation and averaging of performance results.

**Fig. 3**
Machine learning performances for CTEPH and relative importance of model parameters. (A) Performance metrics for the prediction of CTEPH. Error bars indicate the 95% confidence intervals. (B) Relative feature importance of the ten most predictive parameters in the CTEPH prediction model. Abb.: ACC, accuracy; AUC, area under the curve; CTEPH, chronic thromboembolic pulmonary hypertension; NPV, negative predictive value, PPV, positive predictive value; SNS, sensitivity, SPC, specificity.

**Fig. 4**
Machine learning performances for post-PE syndrome and relative importance of model parameters. (A) Performance metrics for the prediction of post-PE syndrome. Error bars indicate the 95% confidence intervals. (B) Relative feature importance of the ten most predictive parameters in the post-PE prediction model. Abb.: ACC, accuracy; AUC, area under the curve; NPV, negative predictive value, PPV, positive predictive value; SNS, sensitivity, SPC, specificity.

See this image and copyright information in PMC

References

1. Raskob, G. E. et al. Thrombosis: A major contributor to global disease burden. Arterioscler Thromb. Vasc. Biol.34(11), 2363–2371 (2014). - PubMed
1. Kahn, S. R. et al. Long-term outcomes after pulmonary embolism: current knowledge and future research. Blood Coagul. Fibrinolysis Int. J. Haemost. Thromb.25(5), 407–415 (2014). - PubMed
1. Sista, A. K. & Klok, F. A. Late outcomes of pulmonary embolism: The post-PE syndrome. Thromb. Res.164, 157–162 (2018). - PubMed
1. Luijten, D. et al. Incidence of chronic thromboembolic pulmonary hypertension after acute pulmonary embolism: An updated systematic review and meta-analysis. Eur. Respir. J.62, 2300449 (2023). - PubMed
1. Klok, F. A. et al. The post-PE syndrome: A new concept for chronic complications of pulmonary embolism. Blood Rev.28(6), 221–226 (2014). - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

GTH Early Career Research Grant 2023/Gesellschaft für Thrombose- und Hämostaseforschung

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
- PubMed Central
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A machine learning approach to identify patients at risk for long-term consequences after pulmonary embolism

Collaborators

Affiliations

A machine learning approach to identify patients at risk for long-term consequences after pulmonary embolism

Authors

Collaborators

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical