Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Apr 9;21(Suppl 5):95.
doi: 10.1186/s12911-021-01460-7.

Unsupervised phenotyping of sepsis using nonnegative matrix factorization of temporal trends from a multivariate panel of physiological measurements

Affiliations

Unsupervised phenotyping of sepsis using nonnegative matrix factorization of temporal trends from a multivariate panel of physiological measurements

Menghan Ding et al. BMC Med Inform Decis Mak. .

Abstract

Background: Sepsis is a highly lethal and heterogeneous disease. Utilization of an unsupervised method may identify novel clinical phenotypes that lead to targeted therapies and improved care.

Methods: Our objective was to derive clinically relevant sepsis phenotypes from a multivariate panel of physiological data using subgraph-augmented nonnegative matrix factorization. We utilized data from the Medical Information Mart for Intensive Care III database of patients who were admitted to the intensive care unit with sepsis. The extracted data contained patient demographics, physiological records, sequential organ failure assessment scores, and comorbidities. We applied frequent subgraph mining to extract subgraphs from physiological time series and performed nonnegative matrix factorization over the subgraphs to derive patient clusters as phenotypes. Finally, we profiled these phenotypes based on demographics, physiological patterns, disease trajectories, comorbidities and outcomes, and performed functional validation of their clinical implications.

Results: We analyzed a cohort of 5782 patients, derived three novel phenotypes of distinct clinical characteristics and demonstrated their prognostic implications on patient outcome. Subgroup 1 included relatively less severe/deadly patients (30-day mortality, 17%) and was the smallest-in-size group (n = 1218, 21%). It was characterized by old age (mean age, 73 years), a male majority (male-to-female ratio, 59-to-41), and complex chronic conditions. Subgroup 2 included the most severe/deadliest patients (30-day mortality, 28%) and was the second-in-size group (n = 2036, 35%). It was characterized by a male majority (male-to-female ratio, 60-to-40), severe organ dysfunction or failure compounded by a wide range of comorbidities, and uniquely high incidences of coagulopathy and liver disease. Subgroup 3 included the least severe/deadly patients (30-day mortality, 10%) and was the largest group (n = 2528, 44%). It was characterized by low age (mean age, 60 years), a balanced gender ratio (male-to-female ratio, 50-to-50), the least complicated conditions, and a uniquely high incidence of neurologic disease. These phenotypes were validated to be prognostic factors of mortality for sepsis patients.

Conclusions: Our results suggest that these phenotypes can be used to develop targeted therapies based on phenotypic heterogeneity and algorithms designed for monitoring, validating and intervening clinical decisions for sepsis patients.

Keywords: Clustering; Frequent subgraph mining; Gradient boosting machine; Intensive care unit; Nonnegative matrix factorization; Phenotyping; Physiological measurements; Sepsis; Unsupervised learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Subgroup 1 trend group selected from representative frequent subgraphs of standardized physiological variable values over measurement period of six time windows
Fig. 2
Fig. 2
Subgroup 2 trend group selected from representative frequent subgraphs of standardized physiological variable values over measurement period of six time windows
Fig. 3
Fig. 3
Subgroup 3 trend group selected from representative frequent subgraphs of standardized physiological variable values over measurement period of six time windows
Fig. 4
Fig. 4
7-Day sofa score charts
Fig. 5
Fig. 5
Subgraph extraction from time series graph
Fig. 6
Fig. 6
Frequent subgraph mining from subgraph corpus
Fig. 7
Fig. 7
Flow diagram for NMF model construction, Sepsis-3 subgroup identification, and associated functional validations
Fig. 8
Fig. 8
Gradient boosting machine ROC curve for patient group membership classification on frequent subgraphs—subgroup 1
Fig. 9
Fig. 9
Gradient boosting machine ROC curve for patient group membership classification on frequent subgraphs—subgroup 2
Fig. 10
Fig. 10
Gradient boosting machine ROC curve for patient group membership classification on frequent subgraphs—subgroup 3
Fig. 11
Fig. 11
Gradient boosting machine ROC curve for patient mortality model on frequent subgraphs

Similar articles

Cited by

References

    1. Angus DC, van der Poll T. Severe sepsis and septic shock. N Engl J Med. 2013;369(9):840–851. doi: 10.1056/NEJMra1208623. - DOI - PubMed
    1. Rangel-Frausto MS, Pittet D, Costigan M, Hwang T, Davis CS, Wenzel RP. The natural history of the systemic inflammatory response syndrome (SIRS). A prospective study. JAMA. 1995;273(2):117–123. doi: 10.1001/jama.1995.03520260039030. - DOI - PubMed
    1. Angus DC, Linde-Zwirble WT, Lidicker J, Clermont G, Carcillo J, Pinsky MR. Epidemiology of severe sepsis in the United States: analysis of incidence, outcome, and associated costs of care. Crit Care Med. 2001;29(7):1303–1310. doi: 10.1097/00003246-200107000-00002. - DOI - PubMed
    1. Parrillo JE, Parker MM, Natanson C, Suffredini AF, Danner RL, Cunnion RE, Ognibene FP. Septic shock in humans. Advances in the understanding of pathogenesis, cardiovascular dysfunction, and therapy. Ann Intern Med. 1990;113(3):227–242. doi: 10.7326/0003-4819-113-3-227. - DOI - PubMed
    1. Rhee C, Dantes R, Epstein L, Murphy DJ, Seymour CW. Incidence and trends of sepsis in US hospitals using clinical vs claims data, 2009–2014. JAMA. 2017;318(13):1241–1249. doi: 10.1001/jama.2017.13836. - DOI - PMC - PubMed

Publication types