Predicting sepsis in-hospital mortality with machine learning: a multi-center study using clinical and inflammatory biomarkers

doi:10.1186/s40001-024-01756-0

Multicenter Study

. 2024 Mar 6;29(1):156.

doi: 10.1186/s40001-024-01756-0.

Predicting sepsis in-hospital mortality with machine learning: a multi-center study using clinical and inflammatory biomarkers

Guyu Zhang¹, Fei Shao¹, Wei Yuan¹, Junyuan Wu¹, Xuan Qi¹, Jie Gao¹, Rui Shao¹, Ziren Tang^#², Tao Wang^#³

Affiliations

¹ Emergency Medicine Clinical Research Center, Beijing Chaoyang Hospital, Capital Medical University, Beijing Key Laboratory of Cardiopulmonary Cerebral Resuscitation, Beijing, 100020, China.
² Emergency Medicine Clinical Research Center, Beijing Chaoyang Hospital, Capital Medical University, Beijing Key Laboratory of Cardiopulmonary Cerebral Resuscitation, Beijing, 100020, China. TangZiren1970@126.com.
³ Emergency Medicine Clinical Research Center, Beijing Chaoyang Hospital, Capital Medical University, Beijing Key Laboratory of Cardiopulmonary Cerebral Resuscitation, Beijing, 100020, China. wangtao19780117@sina.com.

^# Contributed equally.

PMID: 38448999
PMCID: PMC10918942
DOI: 10.1186/s40001-024-01756-0

Multicenter Study

Predicting sepsis in-hospital mortality with machine learning: a multi-center study using clinical and inflammatory biomarkers

Guyu Zhang et al. Eur J Med Res. 2024.

. 2024 Mar 6;29(1):156.

doi: 10.1186/s40001-024-01756-0.

Authors

Guyu Zhang¹, Fei Shao¹, Wei Yuan¹, Junyuan Wu¹, Xuan Qi¹, Jie Gao¹, Rui Shao¹, Ziren Tang^#², Tao Wang^#³

Affiliations

¹ Emergency Medicine Clinical Research Center, Beijing Chaoyang Hospital, Capital Medical University, Beijing Key Laboratory of Cardiopulmonary Cerebral Resuscitation, Beijing, 100020, China.
² Emergency Medicine Clinical Research Center, Beijing Chaoyang Hospital, Capital Medical University, Beijing Key Laboratory of Cardiopulmonary Cerebral Resuscitation, Beijing, 100020, China. TangZiren1970@126.com.
³ Emergency Medicine Clinical Research Center, Beijing Chaoyang Hospital, Capital Medical University, Beijing Key Laboratory of Cardiopulmonary Cerebral Resuscitation, Beijing, 100020, China. wangtao19780117@sina.com.

^# Contributed equally.

PMID: 38448999
PMCID: PMC10918942
DOI: 10.1186/s40001-024-01756-0

Abstract

Background: This study aimed to develop and validate an interpretable machine-learning model that utilizes clinical features and inflammatory biomarkers to predict the risk of in-hospital mortality in critically ill patients suffering from sepsis.

Methods: We enrolled all patients diagnosed with sepsis in the Medical Information Mart for Intensive Care IV (MIMIC-IV, v.2.0), eICU Collaborative Research Care (eICU-CRD 2.0), and the Amsterdam University Medical Centers databases (AmsterdamUMCdb 1.0.2). LASSO regression was employed for feature selection. Seven machine-learning methods were applied to develop prognostic models. The optimal model was chosen based on its accuracy, F1 score and area under curve (AUC) in the validation cohort. Moreover, we utilized the SHapley Additive exPlanations (SHAP) method to elucidate the effects of the features attributed to the model and analyze how individual features affect the model's output. Finally, Spearman correlation analysis examined the associations among continuous predictor variables. Restricted cubic splines (RCS) explored potential non-linear relationships between continuous risk factors and in-hospital mortality.

Results: 3535 patients with sepsis were eligible for participation in this study. The median age of the participants was 66 years (IQR, 55-77 years), and 56% were male. After selection, 12 of the 45 clinical parameters collected on the first day after ICU admission remained associated with prognosis and were used to develop machine-learning models. Among seven constructed models, the eXtreme Gradient Boosting (XGBoost) model achieved the best performance, with an AUC of 0.94 and an F1 score of 0.937 in the validation cohort. Feature importance analysis revealed that Age, AST, invasive ventilation treatment, and serum urea nitrogen (BUN) were the top four features of the XGBoost model with the most significant impact. Inflammatory biomarkers may have prognostic value. Furthermore, SHAP force analysis illustrated how the constructed model visualized the prediction of the model.

Conclusions: This study demonstrated the potential of machine-learning approaches for early prediction of outcomes in patients with sepsis. The SHAP method could improve the interoperability of machine-learning models and help clinicians better understand the reasoning behind the outcome.

Keywords: Intensive care unit; Machining learning; Prediction; Sepsis; XGBoost.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted without any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Fig. 1**
A flowchart illustrating the regulatory model of patient enrollment and analysis workflow. Following the exclusion of 83,829 patients, 3535 patients were included from three databases. MIMIC-IV database: Medical Information Mart for Intensive Care-IV database, eICU-CRD: eICU Collaborative Research Database; AMDS: Amsterdam University Medical Centers database; ROC: receiver operating characteristic curve; DCA: Decision curve analysis

**Fig. 2**
The ROC curve comparison of six models and Sofa score in training cohort and validation cohort. DT: Decision Tree; XGBoost: eXtreme Gradient Boosting; KNN: k-Nearest Neighbors; RF: Random Forest; NB: Naive Bayes; LR: Logistic Regression; SVM: Support Vector Machine. A The ROC curve of validation Cohort, B The ROC curve of test Cohort

**Fig. 3**
The DCA curve comparison of six models and Sofa score in training cohort and validation cohort. DCA: Decision curve analysis; DT: Decision Tree; XGBoost: eXtreme Gradient Boosting; KNN:k-Nearest Neighbors; RF: Random Forest; NB: Naive Bayes; LR: Logistic Regression; SVM: Support Vector Machine. A DCA curve of XGBoost and Sofa score in validation Cohort. B DCA curve of other six models in validation Cohort. C DCA curve of XGBoost and Sofa score in Validation Cohort. D DCA curve of other six models in test Cohort

**Fig. 4**
A Scatter plot of feature values and SHAP values. The purple part of the feature value represents a lower value. B Consent waterfall plot showing an example of interpretability analysis for a patient. The yellow part of the feature value represents a positive effect on the model. The deep red part of the feature value represents a represents a negative effect on the model

**Fig. 5**
The feature importance of SHAP method and conventional method for XGBoost model. A Feature importance of conventional method for the XGBoost model. B Feature importance of SHAP method for the XGBoost model. BUN: Urea nitrogen

**Fig. 6**
The association between variables and hospital mortality. Albumin (A), Potassium (B), NHR (C), Heart rate (D), BUN (E), NLR (F): the restricted cubic splines with four knots. The horizontal dashed line represents the reference OR of 1.0. The model was multivariate-adjusted for Age, AST, whether or not invasive ventilation treatment, whether or not renal replacement treatment, Albumin, whether or not have cerebrovascular disease, MHR, NLR, NHR, Potassium. OR odds ratio; 95% CI 95% confidence interval

**Fig. 7**
Spearman correlation analysis between variables. The color spectrum, ranging from blue to yellow, represents the degree of correlation: closer to blue indicates a stronger positive correlation, while closer to yellow indicates a stronger negative correlation

See this image and copyright information in PMC

Cited by

Navigating the Modern Landscape of Sepsis: Advances in Diagnosis and Treatment.
Jang JH, Choi E, Kim T, Yeo HJ, Jeon D, Kim YS, Cho WH. Jang JH, et al. Int J Mol Sci. 2024 Jul 5;25(13):7396. doi: 10.3390/ijms25137396. Int J Mol Sci. 2024. PMID: 39000503 Free PMC article. Review.
Thirty-day mortality risk prediction for geriatric patients undergoing non-cardiac surgery in the surgical intensive care unit.
Ma M, Liu J, Li C, Chen Y, Jia H, Hou A, Xu H. Ma M, et al. Eur J Med Res. 2025 May 9;30(1):372. doi: 10.1186/s40001-025-02543-1. Eur J Med Res. 2025. PMID: 40346684 Free PMC article.
Artificial Intelligence in Sepsis Management: An Overview for Clinicians.
Bignami EG, Berdini M, Panizzi M, Domenichetti T, Bezzi F, Allai S, Damiano T, Bellini V. Bignami EG, et al. J Clin Med. 2025 Jan 6;14(1):286. doi: 10.3390/jcm14010286. J Clin Med. 2025. PMID: 39797368 Free PMC article. Review.
Machine Learning and Artificial Intelligence for Infectious Disease Surveillance, Diagnosis, and Prognosis.
Cheah BCJ, Vicente CR, Chan KR. Cheah BCJ, et al. Viruses. 2025 Jun 23;17(7):882. doi: 10.3390/v17070882. Viruses. 2025. PMID: 40733500 Free PMC article. Review.
Machine learning based prediction of cognitive metrics using major biomarkers in SuperAgers.
Lee HB, Kwon SY, Park JH, Kim B, Kim GH, Choi JH, Park YM. Lee HB, et al. Sci Rep. 2025 May 28;15(1):18735. doi: 10.1038/s41598-025-01477-2. Sci Rep. 2025. PMID: 40436987 Free PMC article.

See all "Cited by" articles

References

1. Fleischmann C, Scherag A, Adhikari NK, Hartog CS, Tsaganos T, Schlattmann P, Angus DC, Reinhart K. Assessment of global incidence and mortality of hospital-treated sepsis. Current estimates and limitations. Am J Respir Crit Care Med. 2016;193(3):259–272. doi: 10.1164/rccm.201504-0781OC. - DOI - PubMed
1. Denstaedt SJ, Singer BH, Standiford TJ. Sepsis and Nosocomial infection: patient characteristics, mechanisms, and modulation. Front Immunol. 2018;9:2446. doi: 10.3389/fimmu.2018.02446. - DOI - PMC - PubMed
1. Singer M, Deutschman CS, Seymour CW, Shankar-Hari M, Annane D, Bauer M, Bellomo R, Bernard GR, Chiche JD, Coopersmith CM, et al. The third international consensus definitions for sepsis and septic shock (Sepsis-3) JAMA. 2016;315(8):801–810. doi: 10.1001/jama.2016.0287. - DOI - PMC - PubMed
1. Li Y, Wang W, Yang F, Xu Y, Feng C, Zhao Y. The regulatory roles of neutrophils in adaptive immunity. Cell Commun Signal. 2019;17(1):147. doi: 10.1186/s12964-019-0471-y. - DOI - PMC - PubMed
1. Zhu CL, Wang Y, Liu Q, Li HR, Yu CM, Li P, Deng XM, Wang JF. Dysregulation of neutrophil death in sepsis. Front Immunol. 2022;13:963955. doi: 10.3389/fimmu.2022.963955. - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

[1] Fleischmann C, Scherag A, Adhikari NK, Hartog CS, Tsaganos T, Schlattmann P, Angus DC, Reinhart K. Assessment of global incidence and mortality of hospital-treated sepsis. Current estimates and limitations. Am J Respir Crit Care Med. 2016;193(3):259–272. doi: 10.1164/rccm.201504-0781OC. - DOI - PubMed

[2] Fleischmann C, Scherag A, Adhikari NK, Hartog CS, Tsaganos T, Schlattmann P, Angus DC, Reinhart K. Assessment of global incidence and mortality of hospital-treated sepsis. Current estimates and limitations. Am J Respir Crit Care Med. 2016;193(3):259–272. doi: 10.1164/rccm.201504-0781OC. - DOI - PubMed

[3] Denstaedt SJ, Singer BH, Standiford TJ. Sepsis and Nosocomial infection: patient characteristics, mechanisms, and modulation. Front Immunol. 2018;9:2446. doi: 10.3389/fimmu.2018.02446. - DOI - PMC - PubMed

[4] Denstaedt SJ, Singer BH, Standiford TJ. Sepsis and Nosocomial infection: patient characteristics, mechanisms, and modulation. Front Immunol. 2018;9:2446. doi: 10.3389/fimmu.2018.02446. - DOI - PMC - PubMed

[5] Singer M, Deutschman CS, Seymour CW, Shankar-Hari M, Annane D, Bauer M, Bellomo R, Bernard GR, Chiche JD, Coopersmith CM, et al. The third international consensus definitions for sepsis and septic shock (Sepsis-3) JAMA. 2016;315(8):801–810. doi: 10.1001/jama.2016.0287. - DOI - PMC - PubMed

[6] Singer M, Deutschman CS, Seymour CW, Shankar-Hari M, Annane D, Bauer M, Bellomo R, Bernard GR, Chiche JD, Coopersmith CM, et al. The third international consensus definitions for sepsis and septic shock (Sepsis-3) JAMA. 2016;315(8):801–810. doi: 10.1001/jama.2016.0287. - DOI - PMC - PubMed

[7] Li Y, Wang W, Yang F, Xu Y, Feng C, Zhao Y. The regulatory roles of neutrophils in adaptive immunity. Cell Commun Signal. 2019;17(1):147. doi: 10.1186/s12964-019-0471-y. - DOI - PMC - PubMed

[8] Li Y, Wang W, Yang F, Xu Y, Feng C, Zhao Y. The regulatory roles of neutrophils in adaptive immunity. Cell Commun Signal. 2019;17(1):147. doi: 10.1186/s12964-019-0471-y. - DOI - PMC - PubMed

[9] Zhu CL, Wang Y, Liu Q, Li HR, Yu CM, Li P, Deng XM, Wang JF. Dysregulation of neutrophil death in sepsis. Front Immunol. 2022;13:963955. doi: 10.3389/fimmu.2022.963955. - DOI - PMC - PubMed

[10] Zhu CL, Wang Y, Liu Q, Li HR, Yu CM, Li P, Deng XM, Wang JF. Dysregulation of neutrophil death in sepsis. Front Immunol. 2022;13:963955. doi: 10.3389/fimmu.2022.963955. - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Predicting sepsis in-hospital mortality with machine learning: a multi-center study using clinical and inflammatory biomarkers

Affiliations

Predicting sepsis in-hospital mortality with machine learning: a multi-center study using clinical and inflammatory biomarkers

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Medical

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Medical