Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Apr:62:25-30.
doi: 10.1016/j.jcrc.2020.10.033. Epub 2020 Nov 16.

Development of a machine learning algorithm to predict intubation among hospitalized patients with COVID-19

Affiliations

Development of a machine learning algorithm to predict intubation among hospitalized patients with COVID-19

Varun Arvind et al. J Crit Care. 2021 Apr.

Abstract

Purpose: The purpose of this study is to develop a machine learning algorithm to predict future intubation among patients diagnosed or suspected with COVID-19.

Materials and methods: This is a retrospective cohort study of patients diagnosed or under investigation for COVID-19. A machine learning algorithm was trained to predict future presence of intubation based on prior vitals, laboratory, and demographic data. Model performance was compared to ROX index, a validated prognostic tool for prediction of mechanical ventilation.

Results: 4087 patients admitted to five hospitals between February 2020 and April 2020 were included. 11.03% of patients were intubated. The machine learning model outperformed the ROX-index, demonstrating an area under the receiver characteristic curve (AUC) of 0.84 and 0.64, and area under the precision-recall curve (AUPRC) of 0.30 and 0.13, respectively. In the Kaplan-Meier analysis, patients alerted by the model were more likely to require intubation during their admission (p < 0.0001).

Conclusion: In patients diagnosed or under investigation for COVID-19, machine learning can be used to predict future risk of intubation based on clinical data which are routinely collected and available in clinical setting. Such an approach may facilitate identification of high-risk patients to assist in clinical care.

Keywords: COVID-19; Intubation; Machine learning; Prediction; Respiratory distress.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors have no conflicts of interest to disclose.

Figures

Fig. 1
Fig. 1
Time course of a subset of laboratory and vitals data for a patient from time of admission. Example data displayed include partial pressure of arterial oxygen (PAO2), partial pressure of arterial CO2 (PACO2), oxygen saturation (O2 SAT), Creatinine, Platelet count, white blood cell count (WBC), Temperature, Pulse, respiratory rate (RR), systolic blood pressure (Systolic), and Diastolic blood pressure (diastolic). Model predicted risk (blue, increased risk with increasing value), and ROX-index (green, increased risk with decreasing value), are displayed. Red line indicates the time of intubation. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 2
Fig. 2
Receiver operating characteristic (ROC) and precision-recall curves for the machine learning model (blue) and the ROX score (green). Dashed line for the ROC curve indicates random chance. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 3
Fig. 3
Kaplan-Meier plot showing the cumulative probability of remaining free of intubation from time of admission among patients alerted and not-alerted by the model. P-value indicates the result of a log-rank test.
Fig. 4
Fig. 4
(A) Performance metrics AUC, AUPR, and average precision across various cohorts from the hold-out dataset. (B) Performance metrics stratified by sex, BMI ≥ 35, and hospital location.
Supplemental Fig. 1
Supplemental Fig. 1
Schematic of model framework. Using a sampling window of 24-h of vitals, laboratory, and demographics data, the model predicts presence of intubation 72 h from the end of the sampling window, designated as the forecast window. Predictions are made by sliding the sampling window every 12-h, and forecasting predictions 72-h from the end of the respective sampling window.
Supplemental Fig. 2
Supplemental Fig. 2
(A) Mean value of the decision criteria for time-series data used by the random forest model for forecasting predictions across all decision trees ± one standard deviation. (B) Rank-order of time-series and static variables feature importance (standardized score), with greater values indicating increased weight when making predictions by the model. Percent missing of each feature prior to forward-filling.

References

    1. Singh D., Kumar V., Vaishali K.M. Classification of COVID-19 patients from chest CT images using multi-objective differential evolution-based convolutional neural networks. Eur J Clin Microbiol Infect Dis. 2020 doi: 10.1007/s10096-020-03901-z. - DOI - PMC - PubMed
    1. Sharma S. Drawing Insights from COVID-19 Infected Patients With no Past Medical History Using CT Scan Images and Machine Learning Techniques: A Study on 200 Patients n.d. . - DOI - PMC - PubMed
    1. Liu Z., Huang S., Lu W., Su Z., Yin X., Liang H., et al. Modeling the trend of coronavirus disease 2019 and restoration of operational capability of metropolitan medical service in China: A machine learning and mathematical model-based analysis. Glob Health Res Policy. 2020;5:20. - PMC - PubMed
    1. Vaid S., Cakan C., Bhandari M. Using machine learning to estimate unobserved COVID-19 infections in North America. J Bone Joint Surg Am. 2020 doi: 10.2106/JBJS.20.00715. - DOI - PMC - PubMed
    1. Chen X, Liu Z. Early prediction of mortality risk among severe COVID-19 patients using machine learning n.d. 10.1101/2020.04.13.20064329. - DOI - PMC - PubMed