Predicting hospital mortality for patients in the intensive care unit: a comparison of artificial neural networks with logistic regression models
- PMID: 11246308
- DOI: 10.1097/00003246-200102000-00012
Predicting hospital mortality for patients in the intensive care unit: a comparison of artificial neural networks with logistic regression models
Abstract
Objective: Logistic regression (LR), commonly used for hospital mortality prediction, has limitations. Artificial neural networks (ANNs) have been proposed as an alternative. We compared the performance of these approaches by using stepwise reductions in sample size.
Design: Prospective cohort study.
Setting: Seven intensive care units (ICU) at one tertiary care center.
Patients: Patients were 1,647 ICU admissions for whom first-day Acute Physiology and Chronic Health Evaluation III variables were collected.
Interventions: None.
Measurements and main results: We constructed LR and ANN models on a random set of 1,200 admissions (development set) and used the remaining 447 as the validation set. We repeated model construction on progressively smaller development sets (800, 400, and 200 admissions) and retested on the original validation set (n = 447). For each development set, we constructed models from two LR and two ANN architectures, organizing the independent variables differently. With the 1,200-admission development set, all models had good fit and discrimination on the validation set, where fit was assessed by the Hosmer-Lemeshow C statistic (range, 10.6-15.3; p > or = .05) and standardized mortality ratio (SMR) (range, 0.93 [95% confidence interval, 0.79-1.15] to 1.09 [95% confidence interval, 0.89-1.38]), and discrimination was assessed by the area under the receiver operating characteristic curve (range, 0.80-0.84). As development set sample size decreased, model performance on the validation set deteriorated rapidly, although the ANNs retained marginally better fit at 800 (best C statistic was 26.3 [p = .0009] and 13.1 [p = .11] for the LR and ANN models). Below 800, fit was poor with both approaches, with high C statistics (ranging from 22.8 [p <.004] to 633 [p <.0001]) and highly biased SMRs (seven of the eight models below 800 had SMRs of <0.85, with an upper confidence interval of <1). Discrimination ranged from 0.74 to 0.84 below 800.
Conclusions: When sample size is adequate, LR and ANN models have similar performance. However, development sets of < or = 800 were generally inadequate. This is concerning, given typical sample sizes used for individual ICU mortality prediction.
Comment in
-
Some remarks on choosing a method for outcome prediction.Crit Care Med. 2002 Mar;30(3):724. doi: 10.1097/00003246-200203000-00050. Crit Care Med. 2002. PMID: 11990953 No abstract available.
Similar articles
-
Acute Physiology and Chronic Health Evaluation (APACHE) IV: hospital mortality assessment for today's critically ill patients.Crit Care Med. 2006 May;34(5):1297-310. doi: 10.1097/01.CCM.0000215112.84523.F0. Crit Care Med. 2006. PMID: 16540951
-
Evaluation of acute physiology and chronic health evaluation III predictions of hospital mortality in an independent database.Crit Care Med. 1998 Aug;26(8):1317-26. doi: 10.1097/00003246-199808000-00012. Crit Care Med. 1998. PMID: 9710088
-
Prediction of mortality in an Indian intensive care unit. Comparison between APACHE II and artificial neural networks.Intensive Care Med. 2004 Feb;30(2):248-253. doi: 10.1007/s00134-003-2105-4. Epub 2004 Jan 15. Intensive Care Med. 2004. PMID: 14727015
-
[Methodological approach to the use of artificial neural networks for predicting results in medicine].Med Clin (Barc). 2004;122 Suppl 1:59-67. doi: 10.1157/13057536. Med Clin (Barc). 2004. PMID: 14980162 Review. Spanish.
-
[Approach to the methodology of classification and regression trees].Gac Sanit. 2008 Jan-Feb;22(1):65-72. doi: 10.1157/13115113. Gac Sanit. 2008. PMID: 18261446 Review. Spanish.
Cited by
-
In Silico Modeling in Infectious Disease.Drug Discov Today Dis Models. 2007 Oct 1;4(3):117-122. doi: 10.1016/j.ddmod.2007.09.001. Drug Discov Today Dis Models. 2007. PMID: 19707291 Free PMC article.
-
Key Experimental Factors of Machine Learning-Based Identification of Surgery Cancellations.J Healthc Eng. 2021 Feb 20;2021:6247652. doi: 10.1155/2021/6247652. eCollection 2021. J Healthc Eng. 2021. PMID: 33688420 Free PMC article.
-
Using data-driven rules to predict mortality in severe community acquired pneumonia.PLoS One. 2014 Apr 3;9(4):e89053. doi: 10.1371/journal.pone.0089053. eCollection 2014. PLoS One. 2014. PMID: 24699007 Free PMC article.
-
DeepSOFA: A Continuous Acuity Score for Critically Ill Patients using Clinically Interpretable Deep Learning.Sci Rep. 2019 Feb 12;9(1):1879. doi: 10.1038/s41598-019-38491-0. Sci Rep. 2019. PMID: 30755689 Free PMC article.
-
An attention based deep learning model of clinical events in the intensive care unit.PLoS One. 2019 Feb 13;14(2):e0211057. doi: 10.1371/journal.pone.0211057. eCollection 2019. PLoS One. 2019. PMID: 30759094 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials