Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Oct 1;202(7):996-1004.
doi: 10.1164/rccm.202002-0347OC.

Machine Learning Classifier Models Can Identify Acute Respiratory Distress Syndrome Phenotypes Using Readily Available Clinical Data

Affiliations

Machine Learning Classifier Models Can Identify Acute Respiratory Distress Syndrome Phenotypes Using Readily Available Clinical Data

Pratik Sinha et al. Am J Respir Crit Care Med. .

Abstract

Rationale: Two distinct phenotypes of acute respiratory distress syndrome (ARDS) with differential clinical outcomes and responses to randomly assigned treatment have consistently been identified in randomized controlled trial cohorts using latent class analysis. Plasma biomarkers, key components in phenotype identification, currently lack point-of-care assays and represent a barrier to the clinical implementation of phenotypes.Objectives: The objective of this study was to develop models to classify ARDS phenotypes using readily available clinical data only.Methods: Three randomized controlled trial cohorts served as the training data set (ARMA [High vs. Low Vt], ALVEOLI [Assessment of Low Vt and Elevated End-Expiratory Pressure to Obviate Lung Injury], and FACTT [Fluids and Catheter Treatment Trial]; n = 2,022), and a fourth served as the validation data set (SAILS [Statins for Acutely Injured Lungs from Sepsis]; n = 745). A gradient-boosted machine algorithm was used to develop classifier models using 24 variables (demographics, vital signs, laboratory, and respiratory variables) at enrollment. In two secondary analyses, the ALVEOLI and FACTT cohorts each, individually, served as the validation data set, and the remaining combined cohorts formed the training data set for each analysis. Model performance was evaluated against the latent class analysis-derived phenotype.Measurements and Main Results: For the primary analysis, the model accurately classified the phenotypes in the validation cohort (area under the receiver operating characteristic curve [AUC], 0.95; 95% confidence interval [CI], 0.94-0.96). Using a probability cutoff of 0.5 to assign class, inflammatory biomarkers (IL-6, IL-8, and sTNFR-1; P < 0.0001) and 90-day mortality (38% vs. 24%; P = 0.0002) were significantly higher in the hyperinflammatory phenotype as classified by the model. Model accuracy was similar when ALVEOLI (AUC, 0.94; 95% CI, 0.92-0.96) and FACTT (AUC, 0.94; 95% CI, 0.92-0.95) were used as the validation cohorts. Significant treatment interactions were observed with the clinical classifier model-assigned phenotypes in both ALVEOLI (P = 0.0113) and FACTT (P = 0.0072) cohorts.Conclusions: ARDS phenotypes can be accurately identified using machine learning models based on readily available clinical data and may enable rapid phenotype identification at the bedside.

Keywords: ARDS phenotypes; classifier models; machine learning.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
A schematic of the analysis plan and the data sets used in the primary, secondary, and sparse variable set analyses. ALVEOLI = Assessment of Low Vt and Elevated End-Expiratory Pressure to Obviate Lung Injury; ARMA = High vs. Low Vt; FACTT = Fluids and Catheter Treatment Trial; SAILS = Statins for Acutely Injured Lungs from Sepsis.
Figure 2.
Figure 2.
Differences in the plasma biomarker levels in the validation data set (SAILS [Statins for Acutely Injured Lungs from Sepsis] trial) at baseline in the hypoinflammatory and hyperinflammatory phenotypes as identified by the clinical classifier model developed in the primary analysis. P values represent the Wilcoxon rank sum test. (A) IL-6; y-axis upper limit is restricted to 10,000 with 19 observations censored (15 hyperinflammatory and 4 hypoinflammatory). (B) IL-8; y-axis upper limit is restricted to 1,000 with 31 observations censored (24 hyperinflammatory and 7 hypoinflammatory). (C) Protein C. (D) Soluble tumor necrosis factor receptor 1; y-axis upper limit is restricted to 40,000 with three observations censored (all hyperinflammatory). TNF = tumor necrosis factor.
Figure 3.
Figure 3.
Top 10 most important variables in the training data set in the primary analysis. Importance was scaled to 100; 100 represents the most important predictor variable, and a decreasing value represents diminishing importance. BP = blood pressure.
Figure 4.
Figure 4.
Receiver operating characteristic curves of the four grouped sparse variable set classifier models in the validation cohort (SAILS [Statins for Acutely Injured Lungs from Sepsis] trial) of the primary analysis. For each model, the area under the curve is presented in the legend box. AUC = area under the curve; ROC = receiver operating characteristic.

Comment in

Similar articles

Cited by

References

    1. Bellani G, Laffey JG, Pham T, Fan E, Brochard L, Esteban A, et al. LUNG SAFE Investigators; ESICM Trials Group. Epidemiology, patterns of care, and mortality for patients with acute respiratory distress syndrome in intensive care units in 50 countries. JAMA. 2016;315:788–800. - PubMed
    1. Kumar G, Kumar N, Taneja A, Kaleekal T, Tarima S, McGinley E, et al. Milwaukee Initiative in Critical Care Outcomes Research (MICCOR) Group of Investigators. Nationwide trends of severe sepsis in the 21st century (2000-2007) Chest. 2011;140:1223–1231. - PubMed
    1. Matthay MA, McAuley DF, Ware LB. Clinical trials in acute respiratory distress syndrome: challenges and opportunities. Lancet Respir Med. 2017;5:524–534. - PubMed
    1. Marshall JC. Why have clinical trials in sepsis failed? Trends Mol Med. 2014;20:195–203. - PubMed
    1. Sinha P, Calfee CS. Phenotypes in acute respiratory distress syndrome: moving towards precision medicine. Curr Opin Crit Care. 2019;25:12–20. - PMC - PubMed

Publication types

MeSH terms