Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jun 1;6(6):633-641.
doi: 10.1001/jamacardio.2021.0122.

Use of Machine Learning Models to Predict Death After Acute Myocardial Infarction

Affiliations

Use of Machine Learning Models to Predict Death After Acute Myocardial Infarction

Rohan Khera et al. JAMA Cardiol. .

Abstract

Importance: Accurate prediction of adverse outcomes after acute myocardial infarction (AMI) can guide the triage of care services and shared decision-making, and novel methods hold promise for using existing data to generate additional insights.

Objective: To evaluate whether contemporary machine learning methods can facilitate risk prediction by including a larger number of variables and identifying complex relationships between predictors and outcomes.

Design, setting, and participants: This cohort study used the American College of Cardiology Chest Pain-MI Registry to identify all AMI hospitalizations between January 1, 2011, and December 31, 2016. Data analysis was performed from February 1, 2018, to October 22, 2020.

Main outcomes and measures: Three machine learning models were developed and validated to predict in-hospital mortality based on patient comorbidities, medical history, presentation characteristics, and initial laboratory values. Models were developed based on extreme gradient descent boosting (XGBoost, an interpretable model), a neural network, and a meta-classifier model. Their accuracy was compared against the current standard developed using a logistic regression model in a validation sample.

Results: A total of 755 402 patients (mean [SD] age, 65 [13] years; 495 202 [65.5%] male) were identified during the study period. In independent validation, 2 machine learning models, gradient descent boosting and meta-classifier (combination including inputs from gradient descent boosting and a neural network), marginally improved discrimination compared with logistic regression (C statistic, 0.90 for best performing machine learning model vs 0.89 for logistic regression). Nearly perfect calibration in independent validation data was found in the XGBoost (slope of predicted to observed events, 1.01; 95% CI, 0.99-1.04) and the meta-classifier model (slope of predicted-to-observed events, 1.01; 95% CI, 0.99-1.02), with more precise classification across the risk spectrum. The XGBoost model reclassified 32 393 of 121 839 individuals (27%) and the meta-classifier model reclassified 30 836 of 121 839 individuals (25%) deemed at moderate to high risk for death in logistic regression as low risk, which were more consistent with the observed event rates.

Conclusions and relevance: In this cohort study using a large national registry, none of the tested machine learning models were associated with substantive improvement in the discrimination of in-hospital mortality after AMI, limiting their clinical utility. However, compared with logistic regression, XGBoost and meta-classifier models, but not the neural network, offered improved resolution of risk for high-risk individuals.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Disclosures: Dr Spertus reported receiving grants from the American College of Cardiology Foundation during the conduct of the study; in addition, Dr Spertus holds an equity interest in Health Outcomes Sciences. Dr Desai reported receiving grants and personal fees from Amgen, Boehringer Ingelheim, and Cytokinetics and personal fees from Relypsa, Novartis, Medicines Company, and SC Pharmaceuticals outside the submitted work. Dr Masoudi reported being chief scientific adviser and chair of the management board of the American College of Cardiology National Cardiovascular Data Registry outside the submitted work. Dr Normand reported having a pending patent (201810345624.5). Dr Mortazavi reported receiving grants from the National Institute of Biomedical Imaging and Bioengineering, National Heart, Lung, and Blood Institute, US Food and Drug Administration, and the US Department of Defense Advanced Research Projects Agency outside the submitted work; in addition, Dr Mortazavi has a pending patent (US20180315507A1). Dr Krumholz reported receiving personal fees from UnitedHealth, IBM Watson Health, Element Science, Aetna, Facebook, the Siegfried and Jensen Law Firm, the Arnold and Porter Law Firm, the Martin/Baughman Law Firm, and the National Center for Cardiovascular Diseases, Beijing; being a founder of HugoHealth and Refactor Health; and receiving grants and/or contracts from the Centers for Medicare & Medicaid Services, Medtronic, the US Food and Drug Administration, Johnson & Johnson, and the Shenzhen Center for Health Information outside the submitted work. No other disclosures were reported.

Figures

Figure 1.
Figure 1.. Design of Machine Learning Algorithms
The level 1 classifiers consist of 3 independent models each trained on the same initial training sample (sample A), including logistic regression with least absolute shrinkage and selection operator (LASSO), extreme gradient descent boosting (XGBoost), and a neural network. The next training sample (sample B) is then input into the level 1 classifiers, resulting in 3 risk estimates for each observation in sample B, 1 from each level 1 model. These 3 risk estimates are then used to train the level 2 XGBoost classifier (sample C). A final sample (sample D) is input into the level 1 classifiers to obtain risk estimates for input into the level 2 classifier. Performance of the level 1 and level 2 classifiers is assessed using this final training set D.
Figure 2.
Figure 2.. Predicted Risk of In-Hospital Mortality by Machine Learning and Logistic Regression Models
Extreme gradient boosting model (XGBoost) (A), neural network (B), and meta-classifier model (C), using the 29-variable input used in the development of the model by McNamara et al. The shaded areas denote standard error of the calibration.

Comment in

References

    1. Fox KA, Dabbous OH, Goldberg RJ, et al. . Prediction of risk of death and myocardial infarction in the six months after presentation with acute coronary syndrome: prospective multinational observational study (GRACE). BMJ. 2006;333(7578):1091. doi:10.1136/bmj.38985.646481.55 - DOI - PMC - PubMed
    1. Granger CB, Goldberg RJ, Dabbous O, et al. ; Global Registry of Acute Coronary Events Investigators . Predictors of hospital mortality in the global registry of acute coronary events. Arch Intern Med. 2003;163(19):2345-2353. doi:10.1001/archinte.163.19.2345 - DOI - PubMed
    1. Antman EM, Cohen M, Bernink PJ, et al. . The TIMI risk score for unstable angina/non-ST elevation MI: a method for prognostication and therapeutic decision making. JAMA. 2000;284(7):835-842. doi:10.1001/jama.284.7.835 - DOI - PubMed
    1. Fox KA, Poole-Wilson P, Clayton TC, et al. . 5-year outcome of an interventional strategy in non-ST-elevation acute coronary syndrome: the British Heart Foundation RITA 3 randomised trial. Lancet. 2005;366(9489):914-920. doi:10.1016/S0140-6736(05)67222-4 - DOI - PubMed
    1. Souza AD, Migon HS. Bayesian binary regression model: an application to in-hospital death after AMI prediction. Pesquisa Operacional. 2004;24(2):253-267. doi:10.1590/S0101-74382004000200003 - DOI

Publication types