Machine Learning Multimodal Model for Delirium Risk Stratification
- PMID: 40332938
- PMCID: PMC12059973
- DOI: 10.1001/jamanetworkopen.2025.8874
Machine Learning Multimodal Model for Delirium Risk Stratification
Abstract
Importance: Automating the identification of risk for developing hospital delirium with models that use machine learning (ML) could facilitate more rapid prevention, identification, and treatment of delirium. However, there are very few reports on the performance of ML models for delirium risk stratification in live clinical practice.
Objective: To report on development, operationalization, and validation of a multimodal ML model for delirium risk stratification in live clinical practice and its associations with workflow and clinical outcomes.
Design, setting, and participants: This quality improvement study developed an ML model supported by automated electronic medical records to stratify the risk of non-intensive care unit delirium in live clinical practice using the Confusion Assessment Method as the diagnostic reference standard, with an iterative model update method. Data from patients aged at least 60 years admitted to non-intensive care units at Mount Sinai Hospital between January 2016 and January 2020 were used to train and test the ML model presented. The model was validated in live clinical practice from March 2023 to March 2024. Analysis of the model's associations with workflow and clinical outcomes was conducted retrospectively in 2024, comparing hospitalized patients prior to deployment of any model version (pre-ML cohort) and during model clinical deployment (post-ML cohort).
Main outcomes and measures: Outcomes of interest were area under the receiver operating characteristic curve, monthly delirium detection rates, median length of hospital stay, and daily doses of opiate, benzodiazepine, and antipsychotic medications administered.
Results: The overall sample included 32 284 inpatient admissions (mean [SD] age, 73.56 (9.67) years, 15 157 [46.9%] women). A total of 25 261 inpatient admissions of older patients with both medical and surgical primary diagnoses represented the combined model testing and training cohort (median age, 73.37 [66.42-81.36] years) and live clinical deployment validation cohort (median [IQR] age, 72.11 [62.26-78.97] years), while 7023 inpatient admissions of older patients with both medical and surgical primary diagnoses represented the combined pre-ML (median [IQR] age, 74.00 [68.00-81.00] years) and post-ML (median [IQR] age, 75.33 [68.34-82.91] years) cohorts. The model presented is a fusion of electronic medical record patient data features and clinical note features processed by natural language processing. The results of model validation in live clinical practice included an area under the curve of 0.94 (95% CI, 0.93-0.95). Median (IQR) monthly delirium detection rates of inpatients assessed for delirium with the Confusion Assessment Method increased from 4.42% (95% CI, 3.70%-5.14%) in the pre-ML cohort to 17.17% (95% CI, 15.54%-18.80%) in the post-ML cohort (P < .001). Post-ML vs pre-ML cohorts received lower daily doses of benzodiazepines (median [IQR] 0.93 [0.42-2.28] diazepam dose equivalents vs 1.60 [0.66-4.27] diazepam dose equivalents; P < .001) and olanzapine (median [IQR], 1.09 [0.38-2.46] mg vs 2.50 [1.17-6.65] mg; P < .001).
Conclusions and relevance: This quality improvement study demonstrates the feasibility of a novel multimodal ML model to automate delirium risk stratification in live clinical practice. The model demonstrated acceptable performance in live clinical practice and may facilitate resource allocation to enhance delirium identification and care.
Conflict of interest statement
Figures


Similar articles
-
Development and Validation of an Electronic Health Record-Based Machine Learning Model to Estimate Delirium Risk in Newly Hospitalized Patients Without Known Cognitive Impairment.JAMA Netw Open. 2018 Aug 3;1(4):e181018. doi: 10.1001/jamanetworkopen.2018.1018. JAMA Netw Open. 2018. PMID: 30646095 Free PMC article.
-
A Multi-Phase Quality Improvement Initiative for the Treatment of Active Delirium in Older Persons.J Am Geriatr Soc. 2021 Jan;69(1):216-224. doi: 10.1111/jgs.16897. Epub 2020 Nov 4. J Am Geriatr Soc. 2021. PMID: 33150615
-
Development and Validation of a Machine Learning Model for Early Prediction of Delirium in Intensive Care Units Using Continuous Physiological Data: Retrospective Study.J Med Internet Res. 2025 Apr 2;27:e59520. doi: 10.2196/59520. J Med Internet Res. 2025. PMID: 40173433 Free PMC article.
-
Interventions for preventing intensive care unit delirium in adults.Cochrane Database Syst Rev. 2018 Nov 23;11(11):CD009783. doi: 10.1002/14651858.CD009783.pub2. Cochrane Database Syst Rev. 2018. PMID: 30484283 Free PMC article.
-
Navigating the machine learning pipeline: a scoping review of inpatient delirium prediction models.BMJ Health Care Inform. 2023 Jul;30(1):e100767. doi: 10.1136/bmjhci-2023-100767. BMJ Health Care Inform. 2023. PMID: 37407226 Free PMC article.
References
MeSH terms
LinkOut - more resources
Full Text Sources
Medical