Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb 19;4(1):32.
doi: 10.1038/s41746-021-00388-6.

Development and validation of a reinforcement learning algorithm to dynamically optimize mechanical ventilation in critical care

Affiliations

Development and validation of a reinforcement learning algorithm to dynamically optimize mechanical ventilation in critical care

Arne Peine et al. NPJ Digit Med. .

Abstract

The aim of this work was to develop and evaluate the reinforcement learning algorithm VentAI, which is able to suggest a dynamically optimized mechanical ventilation regime for critically-ill patients. We built, validated and tested its performance on 11,943 events of volume-controlled mechanical ventilation derived from 61,532 distinct ICU admissions and tested it on an independent, secondary dataset (200,859 ICU stays; 25,086 mechanical ventilation events). A patient "data fingerprint" of 44 features was extracted as multidimensional time series in 4-hour time steps. We used a Markov decision process, including a reward system and a Q-learning approach, to find the optimized settings for positive end-expiratory pressure (PEEP), fraction of inspired oxygen (FiO2) and ideal body weight-adjusted tidal volume (Vt). The observed outcome was in-hospital or 90-day mortality. VentAI reached a significantly increased estimated performance return of 83.3 (primary dataset) and 84.1 (secondary dataset) compared to physicians' standard clinical care (51.1). The number of recommended action changes per mechanically ventilated patient constantly exceeded those of the clinicians. VentAI chose 202.9% more frequently ventilation regimes with lower Vt (5-7.5 mL/kg), but 50.8% less for regimes with higher Vt (7.5-10 mL/kg). VentAI recommended 29.3% more frequently PEEP levels of 5-7 cm H2O and 53.6% more frequently PEEP levels of 7-9 cmH2O. VentAI avoided high (>55%) FiO2 values (59.8% decrease), while preferring the range of 50-55% (140.3% increase). In conclusion, VentAI provides reproducible high performance by dynamically choosing an optimized, individualized ventilation strategy and thus might be of benefit for critically ill patients.

PubMed Disclaimer

Conflict of interest statement

A.P., G.D., A.S., C.T., G.M., and L.M. are co-founders of Clinomic GmbH. A.P. and L.M. are chief executive officers of Clinomic GmbH. C.T. is chief executive officer of William Harvey Research Limited outside of the submitted work. G.M. received restricted research grants and consultancy fees from BBraun Melsungen, Biotest, Adrenomed, and Sphingotec GmbH outside of the submitted work. L.M. and A.P. received consultancy fees from Sphingotec GmbH. All remaining authors declare that they have no conflict of interests.

Figures

Fig. 1
Fig. 1. VentAI Data Routine.
Flow diagram of the overall cohort, architectural overview of the VentAI algorithm and independent testing on eICU dataset.
Fig. 2
Fig. 2. VentAI Performance.
a VentAI estimated performance return on both datasets (MIMIC-III and eICU) versus clinicians’ performance return with variance in MIMIC-III dataset after the exposure of the policies to 500 models. b Relation between VentAI performance return and estimated 90-day mortality risk in the MIMIC-III dataset. c Relation between VentAI performance return and in-hospital mortality risk in the eICU dataset.
Fig. 3
Fig. 3. Visualization of the action distribution in the 3-dimensional action space (MIMIC-III dataset).
The test set includes 36,225 decision time instances and the designed model facilitates 343 action bins in the action space.
Fig. 4
Fig. 4. Number of action changes (MIMIC-III dataset).
The relative number of action changes (ideal body weight-adjusted tidal volume (Vt), positive end expiratory pressure (PEEP), and fraction of inspired oxygen (FiO2)) is shown in relation to the number of mechanically ventilated patients at each 4 h time step. Clinicians action changes are shown in blue while the VentAI action changes are shown in red.
Fig. 5
Fig. 5. Visualization of two representative patient cases (MIMIC-III dataset).
Visualization of two representative case studies in 4-hour intervals. Both patients died within the observed 90 days. Clinicians’ actions are shown in blue while the VentAI actions are shown in red.
Fig. 6
Fig. 6. Out-of-Bag feature weight analysis of VentAI (MIMIC-III dataset).
Relative weight of each feature using out-of-bag feature weight analysis, based on the relative loss of prediction, represented by an increase of the mean squared error. a Ideal body weight-adjusted tidal volume (mL/kg). b PEEP (cmH20). c FiO2 (%).

References

    1. Zampieri FG, Mazza B. Mechanical ventilation in sepsis: a reappraisal. Shock. 2017;47:41–46. doi: 10.1097/SHK.0000000000000702. - DOI - PubMed
    1. Writing Group for the PReVENT Investigators et al. Effect of a low vs intermediate tidal volume strategy on ventilator-free days in intensive care unit patients without ARDS: a randomized clinical trial. JAMA. 2018;320:1872–1880. doi: 10.1001/jama.2018.14280. - DOI - PMC - PubMed
    1. Slutsky AS, Ranieri VM. Ventilator-induced lung injury. N. Engl. J. Med. 2013;369:2126–2136. doi: 10.1056/NEJMra1208707. - DOI - PubMed
    1. Serpa Neto A, et al. Protective versus conventional ventilation for surgery: a systematic review and individual patient data meta-analysis. Anesthesiology. 2015;123:66–78. doi: 10.1097/ALN.0000000000000706. - DOI - PubMed
    1. Gattinoni L, et al. The future of mechanical ventilation: lessons from the present and the past. Crit. Care Lond. Engl. 2017;21:183. doi: 10.1186/s13054-017-1750-x. - DOI - PMC - PubMed

LinkOut - more resources