Interpretable machine learning model for early prediction of delirium in elderly patients following intensive care unit admission: a derivation and validation study

doi:10.3389/fmed.2024.1399848

. 2024 May 17:11:1399848.

doi: 10.3389/fmed.2024.1399848. eCollection 2024.

Interpretable machine learning model for early prediction of delirium in elderly patients following intensive care unit admission: a derivation and validation study

Dayu Tang¹, Chengyong Ma¹, Yu Xu¹

Affiliations

PMID: 38828233
PMCID: PMC11140063
DOI: 10.3389/fmed.2024.1399848

Interpretable machine learning model for early prediction of delirium in elderly patients following intensive care unit admission: a derivation and validation study

Dayu Tang et al. Front Med (Lausanne). 2024.

. 2024 May 17:11:1399848.

doi: 10.3389/fmed.2024.1399848. eCollection 2024.

Authors

Dayu Tang¹, Chengyong Ma¹, Yu Xu¹

Affiliation

¹ Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu, China.

PMID: 38828233
PMCID: PMC11140063
DOI: 10.3389/fmed.2024.1399848

Abstract

Background and objective: Delirium is the most common neuropsychological complication among older adults admitted to the intensive care unit (ICU) and is often associated with a poor prognosis. This study aimed to construct and validate an interpretable machine learning (ML) for early delirium prediction in older ICU patients.

Methods: This was a retrospective observational cohort study and patient data were extracted from the Medical Information Mart for Intensive Care-IV database. Feature variables associated with delirium, including predisposing factors, disease-related factors, and iatrogenic and environmental factors, were selected using least absolute shrinkage and selection operator regression, and prediction models were built using logistic regression, decision trees, support vector machines, extreme gradient boosting (XGBoost), k-nearest neighbors and naive Bayes methods. Multiple metrics were used for evaluation of performance of the models, including the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, recall, F1 score, calibration plot, and decision curve analysis. SHapley Additive exPlanations (SHAP) were used to improve the interpretability of the final model.

Results: Nine thousand seven hundred forty-eight adults aged 65 years or older were included for analysis. Twenty-six features were selected to construct ML prediction models. Among the models compared, the XGBoost model demonstrated the best performance including the highest AUC (0.836), accuracy (0.765), sensitivity (0.713), recall (0.713), and F1 score (0.725) in the training set. It also exhibited excellent discrimination with AUC of 0.810, good calibration, and had the highest net benefit in the validation cohort. The SHAP summary analysis showed that Glasgow Coma Scale, mechanical ventilation, and sedation were the top three risk features for outcome prediction. The SHAP dependency plot and SHAP force analysis interpreted the model at both the factor level and individual level, respectively.

Conclusion: ML is a reliable tool for predicting the risk of critical delirium in elderly patients. By combining XGBoost and SHAP, it can provide clear explanations for personalized risk prediction and more intuitive understanding of the effect of key features in the model. The establishment of such a model would facilitate the early risk assessment and prompt intervention for delirium.

Keywords: ICU; delirium; elderly; explainable machine learning; prediction model.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
The flowchart and framework of the prediction models.

**Figure 2**
Feature selection by the LASSO regression model. **(A)** The LASSO model underwent tenfold cross-validation to determine the optimal penalization coefficient parameter (lambda). **(B)** The plots depict the LASSO regression coefficients across various penalty parameter values. The lambda. 1se was chosen in our study due to its stricter penalty and ability to reduce overfitting. LASSO, least absolute shrinkage and selection operator.

**Figure 3**
Comprehensive evaluation of machine learning models. **(A)** ROC curves and AUC values of the training set. **(B)** ROC curves and AUC values of the validation set. **(C)** Calibration curves of the XGBoost, DT, KNN models in the validation set. **(D)** Decision curves analysis of the XGBoost, RF, SVM models in the validation set. ROC, receiver operating characteristic; AUC, the area under the receiver operating characteristic curve; LR, logistic regression; XGBoost, extreme gradient boosting; DT, decision tree; SVM, support vector machine; KNN, k-nearest neighbors; NB, naive bayes.

**Figure 4**
Feature importance analysis by SHAP method for XGBoost model. **(A)** SHAP significance analysis of feature importance ranking based on the mean value. **(B)** SHAP summary plot of the XGBoost model. GCS, Glasgow Coma Scale; MV, mechanical ventilation; APSIII, the Acute Physiology Score III; T, temperature; DBP, diastolic blood pressure; SpO₂, oxyhemoglobin saturation; SOFA, the Sequential Organ Failure Assessment Score; MBP, mean blood pressure; R, respiratory rate; SBP, systolic blood pressure; Cl, chloride; BUN, blood urea nitrogen; HR, heart rate; SAPSII, the Simplified Acute Physiology Score II; AF, Atrial fibrillation; Admtype, type of admission; COPD, chronic obstructive pulmonary disease; AKI, acute kidney injury.

**Figure 5**
SHAP dependency plot of features in the XGBoost model. The Y-axis represents SHAP values, while the X-axis represents actual clinical parameters. For binary variables such as MV and sedation, “0” indicates the absence of the condition, while “1” indicates its presence. Significantly, when a feature’s SHAP value is greater than 0, it suggests an increased risk of delirium, whereas a negative SHAP value suggests a reduced risk. GCS, Glasgow Coma Scale; MV, mechanical ventilation; APSIII, the Acute Physiology Score III.

See this image and copyright information in PMC

Cited by

Enhanced machine learning predictive modeling for delirium in elderly ICU patients with COPD and respiratory failure: A retrospective study based on MIMIC-IV.
Wu ZB, Jiang YL, Li SS, Li A. Wu ZB, et al. PLoS One. 2025 Mar 20;20(3):e0319297. doi: 10.1371/journal.pone.0319297. eCollection 2025. PLoS One. 2025. PMID: 40112262 Free PMC article.
Machine learning prediction model with shap interpretation for chronic bronchitis risk assessment based on heavy metal exposure: a nationally representative study.
Xia T, Han K. Xia T, et al. BMC Pulm Med. 2025 May 22;25(1):252. doi: 10.1186/s12890-025-03724-8. BMC Pulm Med. 2025. PMID: 40405146 Free PMC article.
Predicting ICU Delirium in Critically Ill COVID-19 Patients Using Demographic, Clinical, and Laboratory Admission Data: A Machine Learning Approach.
Viegas A, Von Rekowski CP, Araújo R, Viana-Baptista M, Macedo MP, Bento L. Viegas A, et al. Life (Basel). 2025 Jun 30;15(7):1045. doi: 10.3390/life15071045. Life (Basel). 2025. PMID: 40724546 Free PMC article.
Development of a risk prediction model for sepsis-related delirium based on multiple machine learning approaches and an online calculator.
Gao L, Wang GD, Yang XY, Tong SJ, Wang XJ, Chen YR, Bai JY, Zhang YX. Gao L, et al. PLoS One. 2025 Jul 16;20(7):e0323831. doi: 10.1371/journal.pone.0323831. eCollection 2025. PLoS One. 2025. PMID: 40668815 Free PMC article.
Development and internal verification of nomogram for forecasting delirium in the elderly admitted to intensive care units: an analysis of MIMIC-IV database.
Jiang L, Yu D, Yang G, Wu X, Zhang D. Jiang L, et al. Front Neurol. 2025 May 13;16:1580125. doi: 10.3389/fneur.2025.1580125. eCollection 2025. Front Neurol. 2025. PMID: 40433619 Free PMC article.

See all "Cited by" articles

References

1. Mattison MLP. Delirium. Ann Intern Med. (2020) 173:ITC49–64. doi: 10.7326/AITC202010060 - DOI - PubMed
1. Thom RP, Levy-Carrick NC, Bui M, Silbersweig D. Delirium. Am J Psychiatry. (2019) 176:785–93. doi: 10.1176/appi.ajp.2018.18070893 - DOI - PubMed
1. Stollings JL, Kotfis K, Chanques G, Pun BT, Pandharipande PP, Ely EW. Delirium in critical illness: clinical manifestations, outcomes, and management. Intensive Care Med. (2021) 47:1089–103. doi: 10.1007/s00134-021-06503-1, PMID: - DOI - PMC - PubMed
1. Fong TG, Tulebaev SR, Inouye SK. Delirium in elderly adults: diagnosis, prevention and treatment. Nat Rev Neurol. (2009) 5:210–20. doi: 10.1038/nrneurol.2009.24, PMID: - DOI - PMC - PubMed
1. Inouye SK, Westendorp RG, Saczynski JS. Delirium in elderly people. Lancet. (2014) 383:911–22. doi: 10.1016/S0140-6736(13)60688-1, PMID: - DOI - PMC - PubMed

LinkOut - more resources

Full Text Sources

[1] Mattison MLP. Delirium. Ann Intern Med. (2020) 173:ITC49–64. doi: 10.7326/AITC202010060 - DOI - PubMed

[2] Mattison MLP. Delirium. Ann Intern Med. (2020) 173:ITC49–64. doi: 10.7326/AITC202010060 - DOI - PubMed

[3] Thom RP, Levy-Carrick NC, Bui M, Silbersweig D. Delirium. Am J Psychiatry. (2019) 176:785–93. doi: 10.1176/appi.ajp.2018.18070893 - DOI - PubMed

[4] Thom RP, Levy-Carrick NC, Bui M, Silbersweig D. Delirium. Am J Psychiatry. (2019) 176:785–93. doi: 10.1176/appi.ajp.2018.18070893 - DOI - PubMed

[5] Stollings JL, Kotfis K, Chanques G, Pun BT, Pandharipande PP, Ely EW. Delirium in critical illness: clinical manifestations, outcomes, and management. Intensive Care Med. (2021) 47:1089–103. doi: 10.1007/s00134-021-06503-1, PMID: - DOI - PMC - PubMed

[6] Stollings JL, Kotfis K, Chanques G, Pun BT, Pandharipande PP, Ely EW. Delirium in critical illness: clinical manifestations, outcomes, and management. Intensive Care Med. (2021) 47:1089–103. doi: 10.1007/s00134-021-06503-1, PMID: - DOI - PMC - PubMed

[7] Fong TG, Tulebaev SR, Inouye SK. Delirium in elderly adults: diagnosis, prevention and treatment. Nat Rev Neurol. (2009) 5:210–20. doi: 10.1038/nrneurol.2009.24, PMID: - DOI - PMC - PubMed

[8] Fong TG, Tulebaev SR, Inouye SK. Delirium in elderly adults: diagnosis, prevention and treatment. Nat Rev Neurol. (2009) 5:210–20. doi: 10.1038/nrneurol.2009.24, PMID: - DOI - PMC - PubMed

[9] Inouye SK, Westendorp RG, Saczynski JS. Delirium in elderly people. Lancet. (2014) 383:911–22. doi: 10.1016/S0140-6736(13)60688-1, PMID: - DOI - PMC - PubMed

[10] Inouye SK, Westendorp RG, Saczynski JS. Delirium in elderly people. Lancet. (2014) 383:911–22. doi: 10.1016/S0140-6736(13)60688-1, PMID: - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Interpretable machine learning model for early prediction of delirium in elderly patients following intensive care unit admission: a derivation and validation study

Affiliation

Interpretable machine learning model for early prediction of delirium in elderly patients following intensive care unit admission: a derivation and validation study

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Related information

LinkOut - more resources

Full Text Sources