. 2024 Mar 8;14(1):5725.

doi: 10.1038/s41598-024-55577-6.

A hybrid modeling framework for generalizable and interpretable predictions of ICU mortality across multiple hospitals

Moein E Samadi¹, Jorge Guzman-Maldonado², Kateryna Nikulina², Hedieh Mirzaieazar², Konstantin Sharafutdinov², Sebastian Johannes Fritsch^{3

4

5}, Andreas Schuppert²

Affiliations

¹ Institute for Computational Biomedicine, RWTH Aachen University, Aachen, Germany. moein.samadi@rwth-aachen.de.
² Institute for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.
³ Department of Intensive Care Medicine, University Hospital RWTH Aachen, Aachen, Germany.
⁴ Jülich Supercomputing Centre, Forschungszentrum Jülich, Jülich, Germany.
⁵ Center for Advanced Simulation and Analytics (CASA), Forschungszentrum Jülich, Jülich, Germany.

PMID: 38459085
PMCID: PMC10923850
DOI: 10.1038/s41598-024-55577-6

A hybrid modeling framework for generalizable and interpretable predictions of ICU mortality across multiple hospitals

Moein E Samadi et al. Sci Rep. 2024.

. 2024 Mar 8;14(1):5725.

doi: 10.1038/s41598-024-55577-6.

Authors

Moein E Samadi¹, Jorge Guzman-Maldonado², Kateryna Nikulina², Hedieh Mirzaieazar², Konstantin Sharafutdinov², Sebastian Johannes Fritsch^{3

4

5}, Andreas Schuppert²

Affiliations

¹ Institute for Computational Biomedicine, RWTH Aachen University, Aachen, Germany. moein.samadi@rwth-aachen.de.
² Institute for Computational Biomedicine, RWTH Aachen University, Aachen, Germany.
³ Department of Intensive Care Medicine, University Hospital RWTH Aachen, Aachen, Germany.
⁴ Jülich Supercomputing Centre, Forschungszentrum Jülich, Jülich, Germany.
⁵ Center for Advanced Simulation and Analytics (CASA), Forschungszentrum Jülich, Jülich, Germany.

PMID: 38459085
PMCID: PMC10923850
DOI: 10.1038/s41598-024-55577-6

Abstract

The development of reliable mortality risk stratification models is an active research area in computational healthcare. Mortality risk stratification provides a standard to assist physicians in evaluating a patient's condition or prognosis objectively. Particular interest lies in methods that are transparent to clinical interpretation and that retain predictive power once validated across diverse datasets they were not trained on. This study addresses the challenge of consolidating numerous ICD codes for predictive modeling of ICU mortality, employing a hybrid modeling approach that integrates mechanistic, clinical knowledge with mathematical and machine learning models . A tree-structured network connecting independent modules that carry clinical meaning is implemented for interpretability. Our training strategy utilizes graph-theoretic methods for data analysis, aiming to identify the functions of individual black-box modules within the tree-structured network by harnessing solutions from specific max-cut problems. The trained model is then validated on external datasets from different hospitals, demonstrating successful generalization capabilities, particularly in binary-feature datasets where label assessment involves extrapolation.

Keywords: Generalizability; Hybrid modeling; ICD codes; ICU mortality prediction; Interpretability; Machine learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Figure 1**
Average and the 95% confidence interval of the Jaccard similarity measures between data samples from a validation hospital and the Derivation Hospital, emphasizing the degree of relatedness between the Derivation Hospital and four validation hospitals.

**Figure 2**
Proposed structured hybrid model for mortality risk stratification of critically ill, influenza and pneumonia patients in the ICU. The model consists of five modules: kidney failure, infectious and bacterial diseases, liver failure, mental and psychic, and lung failure; with their corresponding input features. The output module combines the precomputations of these modules to determine the overall mortality risk of a patient.

**Figure 3**
AUC–ROC curves comparing the discriminative ability of our hybrid model and the XGBoost model in distinguishing deceased and alive patients. The hybrid model outperformed XGBoost for Validation hospitals 1, 3, and 4, where their similarities with the Derivation Hospital are less pronounced, highlighting the hybrid model’s generalizability.

**Figure 4**
SHAP values distribution for 12 ICD codes in the XGBoost model, used to interpret ICU mortality causes. The figure showcases inconsistency in the feature importance across the five hospitals involved in the study.

**Figure 5**
SHAP value distribution for the hybrid model’s black-box modules across five hospitals. The consistency across hospitals showcases the hybrid model’s interpretability, reliability, and stability in mortality prediction across diverse healthcare settings.

**Figure 6**
Simple case: a tree-structured network with three first-layer modules mapping 7-dimensional binary input variable to binary outputs.

**Figure 7**
The schematic representation of $T_{0}$ for the simple case, which contains $2^{7}$ elements holding the number of 0 labels for each input configuration in given training data.

**Algorithm 1**
Risk stratification algorithm.

**Figure 8**
(a) All $2^{7}$ possible binary inputs of $F_{simple}$ . Each row runs along 8 input configuration $V_{1} = {1, 2, \dots, 8}$ of Module-1 and depicts the inputs variables of $F_{simple}$ with fixed inputs to Module-2 and Module-3. The blue cells in the same row depict all 16 possible pairs of input variables for which the decimal representation of the inputs to the 3 first-layer modules are like (1, j, k) and (4, j, k). (b) To determine the weights of the conflict graph $G_{1} (V_{1}, E_{1})$ of Module-1, we compare the labels of input variables within the same row. (c) The conflict graph $G_{1} (V_{1}, E_{1})$ of Module-1 with both binary and decimal representations of vertices. In the risk stratification algorithm, the value of edge $w_{14}$ results from Eq. (4) iterated over all $j \in V_{2} = {1, 2, 3, 4}$ and $k \in V_{3} = {1, 2, 3, 4}$ .

See this image and copyright information in PMC

References

1. Sekulic AD, Trpkovic SV, Pavlovic AP, Marinkovic OM, Ilic AN. Scoring systems in assessing survival of critically ill ICU patients. Med. Sci. Monit. Int. Med. J. Exp. Clin. Res. 2015;21:2621. - PMC - PubMed
1. Kafan S, et al. Predicting risk score for mechanical ventilation in hospitalized adult patients suffering from covid-19. Anesthesiol. Pain Med. 2021;11:25. doi: 10.5812/aapm.112424. - DOI - PMC - PubMed
1. Verburg IWM, et al. Which models can i use to predict adult ICU length of stay? A systematic review. Crit. Care Med. 2017;45:e222–e231. doi: 10.1097/CCM.0000000000002054. - DOI - PubMed
1. Rapsang AG, Shyam DC. Scoring systems in the intensive care unit: A compendium. Indian J. Crit. Care Med. Peer Rev. 2014;18:220. doi: 10.4103/0972-5229.130573. - DOI - PMC - PubMed
1. Knaus WA, Zimmerman JE, Wagner DP, Draper EA, Lawrence DE. Apache-acute physiology and chronic health evaluation: A physiologically based classification system. Crit. Care Med. 1981;9:591–597. doi: 10.1097/00003246-198108000-00008. - DOI - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions

Grants and funding

01ZZ1803B/Bundesministerium für Bildung und Forschung

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A hybrid modeling framework for generalizable and interpretable predictions of ICU mortality across multiple hospitals

Affiliations

A hybrid modeling framework for generalizable and interpretable predictions of ICU mortality across multiple hospitals

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical