Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2025 Apr 8;15(1):12057.
doi: 10.1038/s41598-025-93961-y.

Deciphering the impact of sepsis phenotypes on improving clinical outcome predictions: a multicenter retrospective analysis based on critical care in China

Affiliations
Multicenter Study

Deciphering the impact of sepsis phenotypes on improving clinical outcome predictions: a multicenter retrospective analysis based on critical care in China

Luyao Zhou et al. Sci Rep. .

Abstract

Sepsis is a clinically heterogeneous disease with high mortality. It is crucial to develop relevant therapeutic strategies for different sepsis phenotypes, but the impact of phenotypes on patients' clinical outcomes is unclear. This study aimed to identify potential sepsis phenotypes using readily available clinical parameters and assess their predictive value for 28-day clinical outcomes by logistic regression analysis. In this retrospective analysis, researchers extracted clinical data from adult patients admitted to the First Affiliated Hospital of Anhui Medical University between April and August 2022 and from the 2014-2015 eICU Collaborative Study database. K-Means clustering was utilized to identify and refine sepsis phenotypes, and their predictive performance was subsequently evaluated. Logistic regression models were trained independently for each phenotype and five-fold cross-validation was used to predict clinical outcomes. Predictive accuracy was then compared to traditional non-clustered prediction methods using model assessment scores. The study cohort consisted of 250 patients from the First Affiliated Hospital of Anhui Medical University, allocated in a 7:3 ratio for training and testing, respectively, and an external validation cohort of 3100 patients from the eICU Cooperative Research Database. The results of the phenotype-based prediction model demonstrated an improvement in F1 score from 0.74 to 0.82 and AUC from 0.74(95%CI 0.71-0.80) to 0.84(95%CI 0.82-0.87), and these results also highlight the superiority of clinical outcome prediction with the help of sepsis phenotypes over traditional prediction methods. Phenotype-based prediction of 28-day clinical outcomes in sepsis demonstrated significant advantages over traditional models, highlighting the impact of phenotype-driven modeling on clinical outcomes in sepsis.

Keywords: Clustering algorithms; Intensive care; Machine learning; Phenotypes; Sepsis.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests. Informed consent: The study protocol was approved by the research ethics board of our institution, and all patients provided informed consent before their medical records was reviewed. The performance of this study conformed with the Declaration of Helsinki.

Figures

Fig. 1
Fig. 1
Study-specific flowchart. The figure illustrates the entire process of training the model using internal and external validation datasets after inclusion in the analysis according to the exclusion criteria.
Fig. 2
Fig. 2
Principal component analysis visualization clustering results. Blue : phenotype A; Green : phenotype B; Yellow : phenotype C.
Fig. 3
Fig. 3
Differences and comparisons among the three phenotypes obtained after clustering in patients with sepsis.
Fig. 4
Fig. 4
Normalized confusion matrix was constructed to compare K-Means and GMM clustering results. Correlation analysis was conducted between the outcomes of K-Means clustering classes A, B, and C, and those of GMM clustering classes A, B, and C.
Fig. 5
Fig. 5
The ROC curves for K-Means clustering (panels A and B) and GMM clustering (panels C and D). The curves compare the prediction of patient clinical outcomes using models constructed from both local and eICU datasets, with results shown for both typed (clustered) and untyped (non-clustered) data.
Fig. 6
Fig. 6
Line plot depicting model F1 and AUC based on varying numbers of features, illustrating specific trends observed from 9 to 15 features.

Similar articles

References

    1. Verdonk, F., Blet, A. & Mebazaa, A. The new sepsis definition. Curr. Opin. Anaesthesiol.30 (2), 200–204. 10.1097/aco.0000000000000446 (2017). - PubMed
    1. Shankar-Hari, M. et al. Developing a new definition and assessing new clinical criteria for septic shock. Jama315 (8). 10.1001/jama.2016.0289 (2016). - PMC - PubMed
    1. Singer, M. et al. The third international consensus definitions for sepsis and septic shock (Sepsis-3). JAMA315 (8), 801–810. 10.1001/jama.2016.0287 (2016). - PMC - PubMed
    1. Peach, B. C. Implications of the new sepsis definition on research and practice. J. Crit. Care38, 259–262. 10.1016/j.jcrc.2016.11.032 (2017). - PubMed
    1. Luyao Zhou, M., Shao, C., Wang, Y. & Wang An early sepsis prediction model utilizing machine learning and unbalanced data processing in a clinical context. Reventive Med. Rep.102841, 2211–3355. 10.1016/j.pmedr.2024.102841 (2024). - PMC - PubMed

Publication types