Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 7;29(1):101.
doi: 10.1186/s13054-025-05336-4.

Machine learning models compared with current clinical indices to predict the outcome of high flow nasal cannula therapy in acute hypoxemic respiratory failure

Affiliations

Machine learning models compared with current clinical indices to predict the outcome of high flow nasal cannula therapy in acute hypoxemic respiratory failure

Hang Yu et al. Crit Care. .

Abstract

Background: Early identification of patients with acute hypoxemic respiratory failure (AHRF) who are at risk of failing high-flow nasal cannula (HFNC) therapy could facilitate closer monitoring, and timely adjustment/escalation of treatment. We aimed to establish whether machine learning (ML) models could predict HFNC outcome, early in the course of treatment, with greater accuracy than currently used clinical indices.

Methods: We developed ML models trained using measurements made within the first 2 h of treatment from 184 AHRF patients (37% HFNC failures) treated at the respiratory ICU of the University Hospital of Modena between 2018 and 2023. For external validation, we used a dataset on 567 AHRF patients (22% failures) comprising 510 patients from the recent RENOVATE trial in Brazil and 57 from the MIMIC-IV and eICU databases in the US. Predictive performance of the ML models was benchmarked against optimized thresholds of the following clinical indices: respiratory rate oxygenation index (ROX) and variants, heart rate to saturation of pulse oxygen (SpO2) ratio, SpO2/FiO2 ratio, PaO2/FiO2 ratio, sequential organ failure assessment and heart rate, acidosis, consciousness, oxygenation and respiratory rate scores.

Results: Internal and external predictive performance of a Support Vector Machine (SVM) ML model was superior to all clinical indices across all scenarios tested. In external validation on the 567-patient dataset, a SVM model trained on non-invasive measurements had an accuracy of 73%, sensitivity of 73%, specificity of 73%, and AUC of 0.79. The ROX index had an accuracy of 64%, sensitivity of 79%, specificity of 60%, and AUC of 0.74. When arterial blood gasses (ABG's) were also used for model training, the SVM model had an accuracy of 83%, sensitivity of 84%, specificity of 82%, and AUC of 0.82 in external validation on the MIMIC-IV/eICU dataset. The modified ROX index, which requires PaO2, achieved 70% accuracy, 63% sensitivity, 74% specificity, and AUC of 0.65.

Conclusions: Decision support tools based on SVM models could provide clinicians with more accurate early predictions of HFNC outcome than currently available clinical indices. If available, ABG measurements could improve the capability to accurately identify patients at risk of failing HFNC therapy.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not Applicable. Consent for publication: Not Applicable. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Flow chart of the data extraction process from RENOVATE and MIMIC-IV/eICU Databases. a Study cohort extracted from RENOVATE study b Study cohort extracted from eICU/MIMIC-IV database
Fig. 2
Fig. 2
Process for model development, validation, performance analysis, and deployment
Fig. 3
Fig. 3
Graphical comparison of the performance of both machine learning models with the ROX index and its variants, on the external validation cohort. a, b The Receiver Operating Characteristic (ROC) curve comparing the performance of various machine learning models and the ROX index, along with its variants, on the external validation cohort. c, d Decision curve analysis (DCA) for SVM model and ROX with its variants on the external validation cohort. The curves demonstrate the net benefit across a range of threshold probabilities, comparing the predictive performance of each model against the ‘Treat-All’ (escalate oxygenation treatment for all patients) and ‘Treat-None’ (do not escalate oxygenation treatment) strategies. The shaded regions represent the range of net benefit for SVM. Net benefit = (true positives/total patients)—(false positives/total patients) × ω, where ω is the odds at the threshold probability. Left Panels: ROC and DCA curves show the performance of a SVM model without using ABG measurements—because such measurements were unavailable in the RENOVATE dataset at matching time points—along with ROX indices. The evaluation was conducted on the combined eICU, MIMIC‑IV, and RENOVATE dataset, which includes 567 patients (440 successes and 127 failures). Right Panels: ROC and DCA curves comparing the performance of a SVM model with all measurements, a SVM model without using ABG measurements, and ROX indices with variants on the eICU + MIMIC-IV dataset (57 patients: 38 successes, 19 failures)
Fig. 4
Fig. 4
SVM model explainabilty analysis: SHAP summary plot and permutation importance plot. a, c Average SHAP values obtained from 500 iterations of repeated fivefold cross-validation on internal dataset. Horizontal axis: The impact of each feature on the model’s prediction. Positive SHAP values indicate that the feature contributes to predicting a HFNC failure, while negative SHAP values indicate a contribution to predicting HFNC success. Vertical axis: The list of features used for making a prediction, ordered based on their importance, with the most important features at the top. Dots: Each dot represents a single patient in the dataset. Colour gradient: Indicates the feature value for each observation, where blue represents low feature values and red represents high feature values. b, d Permutation importance for external validation obtained by SVM model on external dataset. The suffix “_diff” indicates the change in a specific variable between two time points. For instance, “FiO2_diff” represents the change in FiO2 values between T1 and T0 (FiO2_diff = FiO2_T1—FiO2_T0)

References

    1. Frat JP, Thille AW, Mercat A, Girault C, Ragot S, Perbet S, et al. High-flow oxygen through nasal cannula in acute hypoxemic respiratory failure. N Engl J Med. 2015;372:2185–96. - PubMed
    1. Frat JP, Ragot S, Girault C, Perbet S, Prat G, Boulain T, et al. REVA network effect of non-invasive oxygenation strategies in immunocompromised patients with severe acute respiratory failure: a post-hoc analysis of a randomised trial. Lancet Respir Med. 2016;4:646–52. - PubMed
    1. Oczkowski S, Ergan B, Bos L, et al. ERS clinical practice guidelines: high-fow nasal cannula in acute respiratory failure. Eur Respir J. 2022;59(4):2101574. - PubMed
    1. Evans L, Rhodes A, Alhazzani W, et al. Surviving sepsis campaign: international guidelines for management of sepsis and septic shock. Intensive Care Med. 2021;47(11):1181–247. - PMC - PubMed
    1. Spinelli E, Mauri T, Beitler JR, Pesenti A, Brodie D. Respiratory drive in the acute respiratory distress syndrome: pathophysiology, monitoring, and therapeutic interventions. Int Care Med. 2020;46(4):606–18. - PMC - PubMed

MeSH terms

LinkOut - more resources