Development and validation of a reinforcement learning algorithm to dynamically optimize mechanical ventilation in critical care

Arne Peine^#¹, Ahmed Hallawa^#^{1

2}, Johannes Bickenbach¹, Guido Dartmann³, Lejla Begic Fazlic³, Anke Schmeink⁴, Gerd Ascheid², Christoph Thiemermann⁵, Andreas Schuppert⁶, Ryan Kindle^{7

8}, Leo Celi^{7

8

9}, Gernot Marx¹, Lukas Martin¹⁰

Affiliations

¹ Department of Intensive Care and Intermediate Care, University Hospital RWTH Aachen, Pauwelsstreet 30, Aachen, Germany.
² Chair for Integrated Signal Processing Systems, RWTH Aachen University, Kopernikusstreet 16, Aachen, Germany.
³ Environmental Campus Birkenfeld, Trier University of Applied Sciences, Schneidershof, Trier, Germany.
⁴ Research Area Information Theory and Systematic Design of Communication Systems, RWTH Aachen University, Kopernikusstreet 16, Aachen, Germany.
⁵ William Harvey Research Institute, Queen Mary University London, Charterhouse Square, London, United Kingdom.
⁶ Joint Research Center for Computational Biomedicine, RWTH Aachen University, Pauwelsstreet 30, Aachen, Germany.
⁷ Laboratory for Computational Physiology, Harvard-MIT Division of Health Sciences & Technology, Cambridge, MA, USA.
⁸ Division of Pulmonary, Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA.
⁹ Department of Biostatistics Harvard T.H, Chan School of Public Health, Boston, MA, USA.
¹⁰ Department of Intensive Care and Intermediate Care, University Hospital RWTH Aachen, Pauwelsstreet 30, Aachen, Germany. lmartin@ukaachen.de.

^# Contributed equally.

PMID: 33608661
PMCID: PMC7895944
DOI: 10.1038/s41746-021-00388-6

Development and validation of a reinforcement learning algorithm to dynamically optimize mechanical ventilation in critical care

Arne Peine et al. NPJ Digit Med. 2021.

. 2021 Feb 19;4(1):32.

doi: 10.1038/s41746-021-00388-6.

Authors

Affiliations

¹ Department of Intensive Care and Intermediate Care, University Hospital RWTH Aachen, Pauwelsstreet 30, Aachen, Germany.
² Chair for Integrated Signal Processing Systems, RWTH Aachen University, Kopernikusstreet 16, Aachen, Germany.
³ Environmental Campus Birkenfeld, Trier University of Applied Sciences, Schneidershof, Trier, Germany.
⁴ Research Area Information Theory and Systematic Design of Communication Systems, RWTH Aachen University, Kopernikusstreet 16, Aachen, Germany.
⁵ William Harvey Research Institute, Queen Mary University London, Charterhouse Square, London, United Kingdom.
⁶ Joint Research Center for Computational Biomedicine, RWTH Aachen University, Pauwelsstreet 30, Aachen, Germany.
⁷ Laboratory for Computational Physiology, Harvard-MIT Division of Health Sciences & Technology, Cambridge, MA, USA.
⁸ Division of Pulmonary, Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA.
⁹ Department of Biostatistics Harvard T.H, Chan School of Public Health, Boston, MA, USA.
¹⁰ Department of Intensive Care and Intermediate Care, University Hospital RWTH Aachen, Pauwelsstreet 30, Aachen, Germany. lmartin@ukaachen.de.

^# Contributed equally.

PMID: 33608661
PMCID: PMC7895944
DOI: 10.1038/s41746-021-00388-6

Abstract

The aim of this work was to develop and evaluate the reinforcement learning algorithm VentAI, which is able to suggest a dynamically optimized mechanical ventilation regime for critically-ill patients. We built, validated and tested its performance on 11,943 events of volume-controlled mechanical ventilation derived from 61,532 distinct ICU admissions and tested it on an independent, secondary dataset (200,859 ICU stays; 25,086 mechanical ventilation events). A patient "data fingerprint" of 44 features was extracted as multidimensional time series in 4-hour time steps. We used a Markov decision process, including a reward system and a Q-learning approach, to find the optimized settings for positive end-expiratory pressure (PEEP), fraction of inspired oxygen (FiO₂) and ideal body weight-adjusted tidal volume (Vt). The observed outcome was in-hospital or 90-day mortality. VentAI reached a significantly increased estimated performance return of 83.3 (primary dataset) and 84.1 (secondary dataset) compared to physicians' standard clinical care (51.1). The number of recommended action changes per mechanically ventilated patient constantly exceeded those of the clinicians. VentAI chose 202.9% more frequently ventilation regimes with lower Vt (5-7.5 mL/kg), but 50.8% less for regimes with higher Vt (7.5-10 mL/kg). VentAI recommended 29.3% more frequently PEEP levels of 5-7 cm H₂O and 53.6% more frequently PEEP levels of 7-9 cmH₂O. VentAI avoided high (>55%) FiO₂ values (59.8% decrease), while preferring the range of 50-55% (140.3% increase). In conclusion, VentAI provides reproducible high performance by dynamically choosing an optimized, individualized ventilation strategy and thus might be of benefit for critically ill patients.

PubMed Disclaimer

Conflict of interest statement

A.P., G.D., A.S., C.T., G.M., and L.M. are co-founders of Clinomic GmbH. A.P. and L.M. are chief executive officers of Clinomic GmbH. C.T. is chief executive officer of William Harvey Research Limited outside of the submitted work. G.M. received restricted research grants and consultancy fees from BBraun Melsungen, Biotest, Adrenomed, and Sphingotec GmbH outside of the submitted work. L.M. and A.P. received consultancy fees from Sphingotec GmbH. All remaining authors declare that they have no conflict of interests.

Figures

**Fig. 1. VentAI Data Routine.**
Flow diagram of the overall cohort, architectural overview of the VentAI algorithm and independent testing on eICU dataset.

**Fig. 2. VentAI Performance.**
a VentAI estimated performance return on both datasets (MIMIC-III and eICU) versus clinicians’ performance return with variance in MIMIC-III dataset after the exposure of the policies to 500 models. b Relation between VentAI performance return and estimated 90-day mortality risk in the MIMIC-III dataset. c Relation between VentAI performance return and in-hospital mortality risk in the eICU dataset.

**Fig. 3. Visualization of the action distribution in the 3-dimensional action space (MIMIC-III dataset).**
The test set includes 36,225 decision time instances and the designed model facilitates 343 action bins in the action space.

**Fig. 4. Number of action changes (MIMIC-III dataset).**
The relative number of action changes (ideal body weight-adjusted tidal volume (Vt), positive end expiratory pressure (PEEP), and fraction of inspired oxygen (FiO₂)) is shown in relation to the number of mechanically ventilated patients at each 4 h time step. Clinicians action changes are shown in blue while the VentAI action changes are shown in red.

**Fig. 5. Visualization of two representative patient cases (MIMIC-III dataset).**
Visualization of two representative case studies in 4-hour intervals. Both patients died within the observed 90 days. Clinicians’ actions are shown in blue while the VentAI actions are shown in red.

**Fig. 6. Out-of-Bag feature weight analysis of VentAI (MIMIC-III dataset).**
Relative weight of each feature using out-of-bag feature weight analysis, based on the relative loss of prediction, represented by an increase of the mean squared error. a Ideal body weight-adjusted tidal volume (mL/kg). b PEEP (cmH₂0). c FiO₂ (%).

See this image and copyright information in PMC

References

1. Zampieri FG, Mazza B. Mechanical ventilation in sepsis: a reappraisal. Shock. 2017;47:41–46. doi: 10.1097/SHK.0000000000000702. - DOI - PubMed
1. Writing Group for the PReVENT Investigators et al. Effect of a low vs intermediate tidal volume strategy on ventilator-free days in intensive care unit patients without ARDS: a randomized clinical trial. JAMA. 2018;320:1872–1880. doi: 10.1001/jama.2018.14280. - DOI - PMC - PubMed
1. Slutsky AS, Ranieri VM. Ventilator-induced lung injury. N. Engl. J. Med. 2013;369:2126–2136. doi: 10.1056/NEJMra1208707. - DOI - PubMed
1. Serpa Neto A, et al. Protective versus conventional ventilation for surgery: a systematic review and individual patient data meta-analysis. Anesthesiology. 2015;123:66–78. doi: 10.1097/ALN.0000000000000706. - DOI - PubMed
1. Gattinoni L, et al. The future of mechanical ventilation: lessons from the present and the past. Crit. Care Lond. Engl. 2017;21:183. doi: 10.1186/s13054-017-1750-x. - DOI - PMC - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Development and validation of a reinforcement learning algorithm to dynamically optimize mechanical ventilation in critical care

Affiliations

Development and validation of a reinforcement learning algorithm to dynamically optimize mechanical ventilation in critical care

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources