Updating mortality risk estimation in intensive care units from high-dimensional electronic health records with incomplete data
- PMID: 37648995
- PMCID: PMC10466694
- DOI: 10.1186/s12911-023-02264-7
Updating mortality risk estimation in intensive care units from high-dimensional electronic health records with incomplete data
Abstract
Background: The risk of mortality in intensive care units (ICUs) is currently addressed by the implementation of scores using admission data. Their performances are satisfactory when complications occur early after admission; however, they may become irrelevant in the case of long hospital stays. In this study, we developed predictive models of short-term mortality in the ICU from longitudinal data.
Methods: Using data collected throughout patients' stays of at least 48 h from the MIMIC-III database, several statistical learning approaches were compared, including deep neural networks and penalized regression. Missing data were handled using complete-case analysis or multiple imputation.
Results: Complete-case analyses from 19 predictors showed good discrimination (AUC > 0.77 for several approaches) to predict death between 12 and 24 h onward, yet excluded 75% of patients from the initial target cohort, as data was missing for some of the predictors. Multiple imputation allowed us to include 70 predictors and keep 95% of patients, with similar performances.
Conclusion: This proof-of-concept study supports that automated analysis of electronic health records can be of great interest throughout patients' stays as a surveillance tool. Although this framework relies on a large set of predictors, it is robust to data imputation and may be effective early after admission, when data are still scarce.
Keywords: Clinical decision support systems; Electronic health records; Machine learning; Multiple imputation; Neural network.
© 2023. BioMed Central Ltd., part of Springer Nature.
Conflict of interest statement
The authors declare no competing interests.
Figures



Similar articles
-
Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records.Lancet Digit Health. 2020 Apr;2(4):e179-e191. doi: 10.1016/S2589-7500(20)30018-2. Epub 2020 Mar 12. Lancet Digit Health. 2020. PMID: 33328078
-
Survival prediction in intensive-care units based on aggregation of long-term disease history and acute physiology: a retrospective study of the Danish National Patient Registry and electronic patient records.Lancet Digit Health. 2019 Jun;1(2):e78-e89. doi: 10.1016/S2589-7500(19)30024-X. Epub 2019 May 23. Lancet Digit Health. 2019. PMID: 33323232
-
Variation in model performance by data cleanliness and classification methods in the prediction of 30-day ICU mortality, a US nationwide retrospective cohort and simulation study.BMJ Open. 2020 Dec 2;10(12):e041421. doi: 10.1136/bmjopen-2020-041421. BMJ Open. 2020. PMID: 33268427 Free PMC article.
-
Early hospital mortality prediction of intensive care unit patients using an ensemble learning approach.Int J Med Inform. 2017 Dec;108:185-195. doi: 10.1016/j.ijmedinf.2017.10.002. Epub 2017 Oct 5. Int J Med Inform. 2017. PMID: 29132626
-
Use of machine learning to analyse routinely collected intensive care unit data: a systematic review.Crit Care. 2019 Aug 22;23(1):284. doi: 10.1186/s13054-019-2564-9. Crit Care. 2019. PMID: 31439010 Free PMC article.
References
-
- Vincent JL, Moreno R, Takala J, et al. The SOFA (Sepsis-related Organ failure Assessment) score to describe organ dysfunction/failure. On behalf of the Working Group on Sepsis-Related problems of the European Society of Intensive Care Medicine. Intensive Care Med. 1996;22:707–10. doi: 10.1007/BF01709751. - DOI - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources