Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Jun;28(2):298-333.
doi: 10.1007/s10729-025-09699-6. Epub 2025 Apr 9.

Reinforcement learning for healthcare operations management: methodological framework, recent developments, and future research directions

Affiliations
Review

Reinforcement learning for healthcare operations management: methodological framework, recent developments, and future research directions

Qihao Wu et al. Health Care Manag Sci. 2025 Jun.

Abstract

With the advancement in computing power and data science techniques, reinforcement learning (RL) has emerged as a powerful tool for decision-making problems in complex systems. In recent years, the research on RL for healthcare operations has grown rapidly. Especially during the COVID-19 pandemic, RL has played a critical role in optimizing decisions with greater degrees of uncertainty. RL for healthcare applications has been an exciting topic across multiple disciplines, including operations research, operations management, healthcare systems engineering, and data science. This review paper first provides a tutorial on the overall framework of RL, including its key components, training models, and approximators. Then, we present the recent advances of RL in the domain of healthcare operations management (HOM) and analyze the current trends. Our paper concludes by presenting existing challenges and future directions for RL in HOM.

Keywords: Approximate dynamic programming; Healthcare operations; Healthcare services delivery; Markov decision process; Neural networks; Reinforcement learning.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethical Approval: None required. Conflict of interest: The authors report that there is no Conflict of interest to declare.

Figures

Fig. 1
Fig. 1
Number of publications related to RL for HOM by year
Fig. 2
Fig. 2
Mapping from applications to learning methods
Fig. 3
Fig. 3
Mapping from applications to approximators
Fig. 4
Fig. 4
Evolution of RL methodologies used in HOM

Similar articles

Cited by

References

    1. McLaughlin DB (2008) Healthcare operations management. AUPHA
    1. Bellman RE (2010) Dynamic programming. Princeton University Press
    1. Masmoudi M, Jarboui B, Siarry P (2021) Artificial intelligence and data mining in healthcare. Springer
    1. Yu C, Liu J, Nemati S et al (2021) Reinforcement learning in healthcare: a survey. ACM Comput Surv (CSUR) 55(1):1–36
    1. Liu S, See KC, Ngiam KY et al (2020) Reinforcement learning for clinical decision support in critical care: comprehensive review. J Med Int Res 22(7):e18,477 - PMC - PubMed

LinkOut - more resources