Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2022 Nov;37(11):3838-3845.
doi: 10.1111/jocs.16842. Epub 2022 Aug 24.

Machine learning to predict adverse outcomes after cardiac surgery: A systematic review and meta-analysis

Affiliations
Meta-Analysis

Machine learning to predict adverse outcomes after cardiac surgery: A systematic review and meta-analysis

Jahan C Penny-Dimri et al. J Card Surg. 2022 Nov.

Abstract

Background: Machine learning (ML) models are promising tools for predicting adverse postoperative outcomes in cardiac surgery, yet have not translated to routine clinical use. We conducted a systematic review and meta-analysis to assess the predictive performance of ML approaches.

Methods: We conducted an electronic search to find studies assessing ML and traditional statistical models to predict postoperative outcomes. Our primary outcome was the concordance (C-) index of discriminative performance. Using a Bayesian meta-analytic approach we pooled the C-indices with the 95% credible interval (CrI) across multiple outcomes comparing ML methods to logistic regression (LR) and clinical scoring tools. Additionally, we performed critical difference and sensitivity analysis.

Results: We identified 2792 references from the search of which 51 met inclusion criteria. Two postoperative outcomes were amenable for meta-analysis: 30-day mortality and in-hospital mortality. For 30-day mortality, the pooled C-index and 95% CrI were 0.82 (0.79-0.85), 0.80 (0.77-0.84), 0.78 (0.74-0.82) for ML models, LR, and scoring tools respectively. For in-hospital mortality, the pooled C-index was 0.81 (0.78-0.84) and 0.79 (0.73-0.84) for ML models and LR, respectively. There were no statistically significant results indicating ML superiority over LR.

Conclusion: In cardiac surgery patients, for the prediction of mortality, current ML methods do not have greater discriminative power over LR as measured by the C-index.

Keywords: artificial intelligence; cardiac surgery; machine learning; meta-analysis; perioperative risk; systematic review.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Preferred reporting items for systematic reviews and meta‐analysis (PRISMA) flow diagram. The flow diagram shows the flow of studies and exclusions through the different phases of the systematic review.
Figure 2
Figure 2
Distribution of publication rate over time for included studies. The rate of publication of applied machine learning papers in cardiac surgery is currently following an exponential distribution.
Figure 3
Figure 3
Forest plot for 30‐day mortality across model type. Panel (A) shows the subgroup for the best performing machine learning model. Panel (B) shows the subgroup for logistic regression models. Panel (C) shows the subgroup for clinical scoring tools.
Figure 4
Figure 4
Forest plot for in‐hospital mortality across model type. Panel (A) shows the subgroup for the best performing machine learning model. Panel (B) shows the subgroup for logistic regression models.
Figure 5
Figure 5
Critical difference diagram for low‐risk studies. This figure compares the pairwise performance of machine learning (ML) models, logistic regression, and scoring tools and is agnostic to outcome. Groups further to the right are more performant, however, a solid line between groups indicates a lack of statistical significance. In this plot, although there is a trend toward ML models outperforming logistic regression, it is not statistically significant. All other pairwise comparisons are statistically significant.

Comment in

References

    1. Crawford TC, Magruder JT, Grimm JC, et al. Complications after cardiac operations: all are not created equal. Ann Thorac. 2017;103(1):32‐40. 10.1016/j.athoracsur.2016.10.022 - DOI - PubMed
    1. Hote M. Cardiac surgery risk scoring systems: in quest for the best. Heart Asia. 2018;10(1):e011017. 10.1136/heartasia-2018-011017 - DOI - PMC - PubMed
    1. Benedetto U, Dimagli A, Sinha S, et al. Machine learning improves mortality risk prediction after cardiac surgery: systematic review and meta‐analysis. J Thorac Cardiovasc Surg. 2020;163:2075‐2087. 10.1016/j.jtcvs.2020.07.105 - DOI - PubMed
    1. Ripley BD. Pattern Recognition and Neural Networks. Cambridge University Press; 1996. 10.1017/cbo9780511812651 - DOI
    1. Deo RC. Machine learning in medicine. Circulation. 2015;132(20):1920‐1930. 10.1161/circulationaha.115.001593 - DOI - PMC - PubMed