Reinforcement learning for healthcare operations management: methodological framework, recent developments, and future research directions

doi:10.1007/s10729-025-09699-6

Review

. 2025 Jun;28(2):298-333.

doi: 10.1007/s10729-025-09699-6. Epub 2025 Apr 9.

Reinforcement learning for healthcare operations management: methodological framework, recent developments, and future research directions

Qihao Wu¹, Jiangxue Han¹, Yimo Yan¹, Yong-Hong Kuo², Zuo-Jun Max Shen^{3

4}

Affiliations

¹ Department of Data and Systems Engineering, The University of Hong Kong, Hong Kong, China.
² Department of Data and Systems Engineering, The University of Hong Kong, Hong Kong, China. yhkuo@hku.hk.
³ Faculty of Engineering and Business School, The University of Hong Kong, Hong Kong, China.
⁴ Department of Industrial Engineering & Operations Research, University of California, Berkeley, Berkeley, California, USA.

PMID: 40202690
PMCID: PMC12137509
DOI: 10.1007/s10729-025-09699-6

Review

Reinforcement learning for healthcare operations management: methodological framework, recent developments, and future research directions

Qihao Wu et al. Health Care Manag Sci. 2025 Jun.

. 2025 Jun;28(2):298-333.

doi: 10.1007/s10729-025-09699-6. Epub 2025 Apr 9.

Authors

Qihao Wu¹, Jiangxue Han¹, Yimo Yan¹, Yong-Hong Kuo², Zuo-Jun Max Shen^{3

4}

Affiliations

¹ Department of Data and Systems Engineering, The University of Hong Kong, Hong Kong, China.
² Department of Data and Systems Engineering, The University of Hong Kong, Hong Kong, China. yhkuo@hku.hk.
³ Faculty of Engineering and Business School, The University of Hong Kong, Hong Kong, China.
⁴ Department of Industrial Engineering & Operations Research, University of California, Berkeley, Berkeley, California, USA.

PMID: 40202690
PMCID: PMC12137509
DOI: 10.1007/s10729-025-09699-6

Abstract

With the advancement in computing power and data science techniques, reinforcement learning (RL) has emerged as a powerful tool for decision-making problems in complex systems. In recent years, the research on RL for healthcare operations has grown rapidly. Especially during the COVID-19 pandemic, RL has played a critical role in optimizing decisions with greater degrees of uncertainty. RL for healthcare applications has been an exciting topic across multiple disciplines, including operations research, operations management, healthcare systems engineering, and data science. This review paper first provides a tutorial on the overall framework of RL, including its key components, training models, and approximators. Then, we present the recent advances of RL in the domain of healthcare operations management (HOM) and analyze the current trends. Our paper concludes by presenting existing challenges and future directions for RL in HOM.

Keywords: Approximate dynamic programming; Healthcare operations; Healthcare services delivery; Markov decision process; Neural networks; Reinforcement learning.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethical Approval: None required. Conflict of interest: The authors report that there is no Conflict of interest to declare.

Figures

**Fig. 1**
Number of publications related to RL for HOM by year

**Fig. 2**
Mapping from applications to learning methods

**Fig. 3**
Mapping from applications to approximators

**Fig. 4**
Evolution of RL methodologies used in HOM

See this image and copyright information in PMC

Cited by

e-Health Strategy for Surgical Prioritization: A Methodology Based on Digital Twins and Reinforcement Learning.
Silva-Aravena F, Morales J, Jayabalan M. Silva-Aravena F, et al. Bioengineering (Basel). 2025 Jun 2;12(6):605. doi: 10.3390/bioengineering12060605. Bioengineering (Basel). 2025. PMID: 40564421 Free PMC article.
A hybrid reinforcement learning and knowledge graph framework for financial risk optimization in healthcare systems.
Uddin MS, Ahmed A, Aktarujjaman M, Moniruzzaman M, Ahmed M, Mridha MF, Hossen MJ. Uddin MS, et al. Sci Rep. 2025 Aug 8;15(1):29057. doi: 10.1038/s41598-025-14355-8. Sci Rep. 2025. PMID: 40781534 Free PMC article.

References

1. McLaughlin DB (2008) Healthcare operations management. AUPHA
1. Bellman RE (2010) Dynamic programming. Princeton University Press
1. Masmoudi M, Jarboui B, Siarry P (2021) Artificial intelligence and data mining in healthcare. Springer
1. Yu C, Liu J, Nemati S et al (2021) Reinforcement learning in healthcare: a survey. ACM Comput Surv (CSUR) 55(1):1–36
1. Liu S, See KC, Ngiam KY et al (2020) Reinforcement learning for clinical decision support in critical care: comprehensive review. J Med Int Res 22(7):e18,477 - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- PubMed Central
- Springer
Medical
- MedlinePlus Health Information

[1] McLaughlin DB (2008) Healthcare operations management. AUPHA

[2] McLaughlin DB (2008) Healthcare operations management. AUPHA

[3] Bellman RE (2010) Dynamic programming. Princeton University Press

[4] Bellman RE (2010) Dynamic programming. Princeton University Press

[5] Masmoudi M, Jarboui B, Siarry P (2021) Artificial intelligence and data mining in healthcare. Springer

[6] Masmoudi M, Jarboui B, Siarry P (2021) Artificial intelligence and data mining in healthcare. Springer

[7] Yu C, Liu J, Nemati S et al (2021) Reinforcement learning in healthcare: a survey. ACM Comput Surv (CSUR) 55(1):1–36

[8] Yu C, Liu J, Nemati S et al (2021) Reinforcement learning in healthcare: a survey. ACM Comput Surv (CSUR) 55(1):1–36

[9] Liu S, See KC, Ngiam KY et al (2020) Reinforcement learning for clinical decision support in critical care: comprehensive review. J Med Int Res 22(7):e18,477 - PMC - PubMed

[10] Liu S, See KC, Ngiam KY et al (2020) Reinforcement learning for clinical decision support in critical care: comprehensive review. J Med Int Res 22(7):e18,477 - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Reinforcement learning for healthcare operations management: methodological framework, recent developments, and future research directions

Affiliations

Reinforcement learning for healthcare operations management: methodological framework, recent developments, and future research directions

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical