Multicenter Study

. 2025 May 20;333(19):1688-1698.

doi: 10.1001/jama.2025.3046.

Optimal Vasopressin Initiation in Septic Shock: The OVISS Reinforcement Learning Study

Alexandre Kalimouttou^{1

2

3}, Jason N Kennedy^{4

5}, Jean Feng⁶, Harvineet Singh⁶, Suchi Saria^{7

8

9

10

11}, Derek C Angus^{4

5}, Christopher W Seymour^{4

5}, Romain Pirracchio^{1

2

6}

Affiliations

¹ Inserm UMR 1153, Centre for Research in Epidemiology and Statistics (CRESS), ECSTRRA Team, Université Paris Cité and Université Sorbonne Paris Nord, Paris, France.
² Department of Anesthesia & Perioperative Care, Zuckerberg San Francisco General Hospital and Trauma Center, University of California, San Francisco.
³ Department of Anesthesiology and Intensive Care Medicine, Grenoble Alpes University Hospital, Grenoble, France.
⁴ Department of Critical Care Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania.
⁵ Center for Research, Investigation, and Systems Modeling of Acute Illness, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania.
⁶ Department of Epidemiology & Biostatistics, University of California, San Francisco.
⁷ Malone Center for Engineering in Healthcare, Johns Hopkins University, Baltimore, Maryland.
⁸ Department of Computer Science, Johns Hopkins University, Baltimore, Maryland.
⁹ Department of Medicine, Johns Hopkins School of Medicine, Baltimore, Maryland.
¹⁰ Department of Health Policy and Management, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland.
¹¹ Bayesian Health, New York, New York.

PMID: 40098600
PMCID: PMC11920879
DOI: 10.1001/jama.2025.3046

Multicenter Study

Optimal Vasopressin Initiation in Septic Shock: The OVISS Reinforcement Learning Study

Alexandre Kalimouttou et al. JAMA. 2025.

. 2025 May 20;333(19):1688-1698.

doi: 10.1001/jama.2025.3046.

Authors

Affiliations

¹ Inserm UMR 1153, Centre for Research in Epidemiology and Statistics (CRESS), ECSTRRA Team, Université Paris Cité and Université Sorbonne Paris Nord, Paris, France.
² Department of Anesthesia & Perioperative Care, Zuckerberg San Francisco General Hospital and Trauma Center, University of California, San Francisco.
³ Department of Anesthesiology and Intensive Care Medicine, Grenoble Alpes University Hospital, Grenoble, France.
⁴ Department of Critical Care Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania.
⁵ Center for Research, Investigation, and Systems Modeling of Acute Illness, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania.
⁶ Department of Epidemiology & Biostatistics, University of California, San Francisco.
⁷ Malone Center for Engineering in Healthcare, Johns Hopkins University, Baltimore, Maryland.
⁸ Department of Computer Science, Johns Hopkins University, Baltimore, Maryland.
⁹ Department of Medicine, Johns Hopkins School of Medicine, Baltimore, Maryland.
¹⁰ Department of Health Policy and Management, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland.
¹¹ Bayesian Health, New York, New York.

PMID: 40098600
PMCID: PMC11920879
DOI: 10.1001/jama.2025.3046

Erratum in

Errors in Text.
[No authors listed] [No authors listed] JAMA. 2025 May 6;333(17):1549. doi: 10.1001/jama.2025.5041. JAMA. 2025. PMID: 40193113 Free PMC article. No abstract available.

Abstract

Importance: Norepinephrine is the first-line vasopressor for patients with septic shock. When and whether a second agent, such as vasopressin, should be added is unknown.

Objective: To derive and validate a reinforcement learning model to determine the optimal initiation rule for vasopressin in adult, critically ill patients receiving norepinephrine for septic shock.

Design, setting, and participants: Reinforcement learning was used to generate the optimal rule for vasopressin initiation to improve short-term and hospital outcomes, using electronic health record data from 3608 patients who met the Sepsis-3 shock criteria at 5 California hospitals from 2012 to 2023. The rule was evaluated in 628 patients from the California dataset and 3 external datasets comprising 10 217 patients from 227 US hospitals, using weighted importance sampling and pooled logistic regression with inverse probability weighting.

Exposures: Clinical, laboratory, and treatment variables grouped hourly for 120 hours in the electronic health record.

Main outcome and measure: The primary outcome was in-hospital mortality.

Results: The derivation cohort (n = 3608) included 2075 men (57%) and had a median (IQR) age of 63 (56-70) years and Sequential Organ Failure Assessment (SOFA) score at shock onset of 5 (3-7 [range, 0-24, with higher scores associated with greater mortality]). The validation cohorts (n = 10 217) were 56% male (n = 5743) with a median (IQR) age of 67 (57-75) years and a SOFA score of 6 (4-9). In validation data, the model suggested vasopressin initiation in more patients (87% vs 31%), earlier relative to shock onset (median [IQR], 4 [1-8] vs 5 [1-14] hours), and at lower norepinephrine doses (median [IQR], 0.20 [0.08-0.45] vs 0.37 [0.17-0.69] µg/kg/min) compared with clinicians' actions. The rule was associated with a larger expected reward in validation data compared with clinician actions (weighted importance sampling difference, 31 [95% CI, 15-52]). The adjusted odds of hospital mortality were lower if vasopressin initiation was similar to the rule compared with different (odds ratio, 0.81 [95% CI, 0.73-0.91]), a finding consistent across external validation sets.

Conclusions and relevance: In adult patients with septic shock receiving norepinephrine, the use of vasopressin was variable. A reinforcement learning model developed and validated in several observational datasets recommended more frequent and earlier use of vasopressin than average care patterns and was associated with reduced mortality.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Disclosures: Dr Saria reported holding a leadership position at Bayesian Health; receiving honoraria for lectures on artificial intelligence and health from various tech, biotech, healthtech, and professional societies; and holding equity in several companies, including Bayesian Health, Century Health, Specta Health, Midstream Health, Latent Health, and Duality Tech. Dr Seymour reported receiving grants from National Institutes of Health (NIH)/National Institute of General Medical Sciences; personal fees from Beckman Coulter, Octapharma, and Deepull outside the submitted work; and travel funding for research conferences from the International Sepsis Forum and the International Symposium on Intensive Care and Emergency Medicine. Dr Pirracchio reported receiving grants from AOP Health during the conduct of the study; and research grants from NIH, Patient-Centered Outcomes Research Institute, and US Department of Defense. No other disclosures were reported.

Figures

**Figure 1.. Patient Accrual in a Study of a Reinforcement Learning Rule for Vasopressin Initiation in Septic Shock**
eICU-CRD indicates eICU Collaborative Research Database; MIMIC-IV, Medical Information Mart for Intensive Care; UCSF, University of California, San Francisco; UPMC, University of Pittsburgh Medical Center.

**Figure 2.. Comparison of Clinician-Observed Administration of Vasopressin With Treatment Recommended by the Reinforcement Learning Rule**
A and B, 100 randomly selected patients included in each panel. Each line represents 1 patient trajectory. Red indicates the patient received norepinephrine alone, with a color scale representing the norepinephrine dose, and purple indicates both norepinephrine and vasopressin were infused. White corresponds to discharge alive. Black boxes at the end of the trajectory represent mortality. C, Number of patients in whom vasopressin was initiated in each time block for the clinical observed actions. D, Number of patients in whom vasopressin was initiated per the reinforcement learning rule.

**Figure 3.. Weighted Importance Sampling**
Weighted importance sampling measures the mean individual reward obtained using the reinforcement learning rule and the mean reward associated with the clinician-observed actions. Weighted importance sampling was estimated in the internal and external validation sets (overall reward) for each reward component independently (reward component) and each internal and external validation set separately (internal/external validation). Results are presented as the difference in weighted importance sampling between the reinforcement learning rule and the clinician’s observed rule, with bootstrapped 95% CIs. A negative weighted importance sampling difference indicated that the clinician-observed actions were associated with a higher reward, whereas a positive difference suggested the reinforcement learning rule yielded a higher reward. For example, the reinforcement learning rule was associated with a higher overall reward (weighted importance sampling difference, 31 [95% CI, 15-52]) as well as higher rewards for each component individually. In the UCSF internal validation set, the lower bound of the 95% CI crossed 0 (weighted importance sampling difference, 15 [95% CI, −48 to 129]), indicating that the overall reward obtained with the reinforcement learning rule was not statistically higher than that associated with the clinician-observed actions. The dotted line is the reference line (ie, no difference in weighted importance sampling between the algorithm rule and the clinician-observed actions). eICU-CRD indicates eICU Collaborative Research Database; MIMIC-IV, Medical Information Mart for Intensive Care; SOFA, Sequential Organ Failure Assessment; UCSF, University of California, San Francisco; UPMC, University of Pittsburgh Medical Center.

**Figure 4.. Risk-Adjusted Odds of In-Hospital Mortality Comparing Concordance With the Reinforcement Learning Rule or a Simple Clinical Rule With Clinician-Observed Actions**
Distribution using a regular standard error estimator or a robust standard error estimator. Reinforcement learning rule results displayed for combined validation cohort as well as each individual cohort. Simple clinical decision rule results displayed for combined validation cohort only. The risk-adjusted odds for in-hospital mortality were derived from inverse probability of treatment weighted pooled logistic regression models, adjusting for baseline and time-varying confounders. The results for vasopressin initiated per the reinforcement learning rule show the ORs for in-hospital mortality of concordance with the reinforcement learning rule in each 1-hour epoch compared with the clinician-observed actions for the overall external validation set and for each external validation dataset separately. The results for vasopressin initiated per the simple clinical rule show the ORs for in-hospital mortality of concordance with 3 independent simple clinical rules for vasopressin initiation in each 1-hour epoch for the overall external validation set. The 3 simple rules are: “initiate vasopressin when serum lactate is >4 mmol/L,” “initiate vasopressin when norepinephrine dose is >0.7 μg/kg/min,” and “initiate vasopressin when MAP is <65 mm Hg and time from shock onset is at least 12 hours.” eICU-CRD indicates eICU Collaborative Research Database; MAP, mean arterial pressure; MIMIC-IV, Medical Information Mart for Intensive Care; OR, odds ratio; UPMC, University of Pittsburgh Medical Center.

See this image and copyright information in PMC

References

1. Fleischmann-Struzek C, Mellhammar L, Rose N, et al. Incidence and mortality of hospital- and ICU-treated sepsis: results from an updated and expanded systematic review and meta-analysis. Intensive Care Med. 2020;46(8):1552-1562. doi: 10.1007/s00134-020-06151-x - DOI - PMC - PubMed
1. Singer M, Deutschman CS, Seymour CW, et al. The third international consensus definitions for sepsis and septic shock (Sepsis-3). JAMA. 2016;315(8):801-810. doi: 10.1001/jama.2016.0287 - DOI - PMC - PubMed
1. Evans L, Rhodes A, Alhazzani W, et al. Executive summary: Surviving Sepsis Campaign: international guidelines for the management of sepsis and septic shock 2021. Crit Care Med. 2021;49(11):1974-1982. doi: 10.1097/CCM.0000000000005357 - DOI - PubMed
1. Vail EA, Gershengorn HB, Hua M, Walkey AJ, Wunsch H. Epidemiology of vasopressin use for adults with septic shock. Ann Am Thorac Soc. 2016;13(10):1760-1767. doi: 10.1513/AnnalsATS.201604-259OC - DOI - PMC - PubMed
1. Sutton RS, Barto AG. The reinforcement learning problem. 1998. Accessed July 3, 2023. http://incompleteideas.net/book/first/Chap3PrePub.pdf

Publication types

Actions
Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions

Grants and funding

R35 GM119519/GM/NIGMS NIH HHS/United States

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Optimal Vasopressin Initiation in Septic Shock: The OVISS Reinforcement Learning Study

Affiliations

Optimal Vasopressin Initiation in Septic Shock: The OVISS Reinforcement Learning Study

Authors

Affiliations

Erratum in

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous