Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 8;14(1):5725.
doi: 10.1038/s41598-024-55577-6.

A hybrid modeling framework for generalizable and interpretable predictions of ICU mortality across multiple hospitals

Affiliations

A hybrid modeling framework for generalizable and interpretable predictions of ICU mortality across multiple hospitals

Moein E Samadi et al. Sci Rep. .

Abstract

The development of reliable mortality risk stratification models is an active research area in computational healthcare. Mortality risk stratification provides a standard to assist physicians in evaluating a patient's condition or prognosis objectively. Particular interest lies in methods that are transparent to clinical interpretation and that retain predictive power once validated across diverse datasets they were not trained on. This study addresses the challenge of consolidating numerous ICD codes for predictive modeling of ICU mortality, employing a hybrid modeling approach that integrates mechanistic, clinical knowledge with mathematical and machine learning models . A tree-structured network connecting independent modules that carry clinical meaning is implemented for interpretability. Our training strategy utilizes graph-theoretic methods for data analysis, aiming to identify the functions of individual black-box modules within the tree-structured network by harnessing solutions from specific max-cut problems. The trained model is then validated on external datasets from different hospitals, demonstrating successful generalization capabilities, particularly in binary-feature datasets where label assessment involves extrapolation.

Keywords: Generalizability; Hybrid modeling; ICD codes; ICU mortality prediction; Interpretability; Machine learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Average and the 95% confidence interval of the Jaccard similarity measures between data samples from a validation hospital and the Derivation Hospital, emphasizing the degree of relatedness between the Derivation Hospital and four validation hospitals.
Figure 2
Figure 2
Proposed structured hybrid model for mortality risk stratification of critically ill, influenza and pneumonia patients in the ICU. The model consists of five modules: kidney failure, infectious and bacterial diseases, liver failure, mental and psychic, and lung failure; with their corresponding input features. The output module combines the precomputations of these modules to determine the overall mortality risk of a patient.
Figure 3
Figure 3
AUC–ROC curves comparing the discriminative ability of our hybrid model and the XGBoost model in distinguishing deceased and alive patients. The hybrid model outperformed XGBoost for Validation hospitals 1, 3, and 4, where their similarities with the Derivation Hospital are less pronounced, highlighting the hybrid model’s generalizability.
Figure 4
Figure 4
SHAP values distribution for 12 ICD codes in the XGBoost model, used to interpret ICU mortality causes. The figure showcases inconsistency in the feature importance across the five hospitals involved in the study.
Figure 5
Figure 5
SHAP value distribution for the hybrid model’s black-box modules across five hospitals. The consistency across hospitals showcases the hybrid model’s interpretability, reliability, and stability in mortality prediction across diverse healthcare settings.
Figure 6
Figure 6
Simple case: a tree-structured network with three first-layer modules mapping 7-dimensional binary input variable to binary outputs.
Figure 7
Figure 7
The schematic representation of T0 for the simple case, which contains 27 elements holding the number of 0 labels for each input configuration in given training data.
Algorithm 1
Algorithm 1
Risk stratification algorithm.
Figure 8
Figure 8
(a) All 27 possible binary inputs of Fsimple. Each row runs along 8 input configuration V1={1,2,,8} of Module-1 and depicts the inputs variables of Fsimple with fixed inputs to Module-2 and Module-3. The blue cells in the same row depict all 16 possible pairs of input variables for which the decimal representation of the inputs to the 3 first-layer modules are like (1, jk) and (4, jk). (b) To determine the weights of the conflict graph G1(V1,E1) of Module-1, we compare the labels of input variables within the same row. (c) The conflict graph G1(V1,E1) of Module-1 with both binary and decimal representations of vertices. In the risk stratification algorithm, the value of edge w14 results from Eq. (4) iterated over all jV2={1,2,3,4} and kV3={1,2,3,4}.

Similar articles

References

    1. Sekulic AD, Trpkovic SV, Pavlovic AP, Marinkovic OM, Ilic AN. Scoring systems in assessing survival of critically ill ICU patients. Med. Sci. Monit. Int. Med. J. Exp. Clin. Res. 2015;21:2621. - PMC - PubMed
    1. Kafan S, et al. Predicting risk score for mechanical ventilation in hospitalized adult patients suffering from covid-19. Anesthesiol. Pain Med. 2021;11:25. doi: 10.5812/aapm.112424. - DOI - PMC - PubMed
    1. Verburg IWM, et al. Which models can i use to predict adult ICU length of stay? A systematic review. Crit. Care Med. 2017;45:e222–e231. doi: 10.1097/CCM.0000000000002054. - DOI - PubMed
    1. Rapsang AG, Shyam DC. Scoring systems in the intensive care unit: A compendium. Indian J. Crit. Care Med. Peer Rev. 2014;18:220. doi: 10.4103/0972-5229.130573. - DOI - PMC - PubMed
    1. Knaus WA, Zimmerman JE, Wagner DP, Draper EA, Lawrence DE. Apache-acute physiology and chronic health evaluation: A physiologically based classification system. Crit. Care Med. 1981;9:591–597. doi: 10.1097/00003246-198108000-00008. - DOI - PubMed