Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Feb 22:2021:6657119.
doi: 10.1155/2021/6657119. eCollection 2021.

Reinforcement Learning in Neurocritical and Neurosurgical Care: Principles and Possible Applications

Affiliations
Review

Reinforcement Learning in Neurocritical and Neurosurgical Care: Principles and Possible Applications

Ying Liu et al. Comput Math Methods Med. .

Abstract

Dynamic decision-making was essential in the clinical care of surgical patients. Reinforcement learning (RL) algorithm is a computational method to find sequential optimal decisions among multiple suboptimal options. This review is aimed at introducing RL's basic concepts, including three basic components: the state, the action, and the reward. Most medical studies using reinforcement learning methods were trained on a fixed observational dataset. This paper also reviews the literature of existing practical applications using reinforcement learning methods, which can be further categorized as a statistical RL study and a computational RL study. The review proposes several potential aspects where reinforcement learning can be applied in neurocritical and neurosurgical care. These include sequential treatment strategies of intracranial tumors and traumatic brain injury and intraoperative endoscope motion control. Several limitations of reinforcement learning are representations of basic components, the positivity violation, and validation methods.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflicts of interest.

Figures

Figure 1
Figure 1
(a) A patient with traumatic brain injury and intracranial hypertension; sequential treatment includes concentrated sodium, mechanical ventilation, sedation, and possible outcomes. (b) The trajectories (strategies) of three patients and their expected total reward from all treatments performed.
Figure 2
Figure 2
Uniform conceptions in reinforcement learning: the state, the action, and the reward. Physicians gave treatment (action, A) to the patient (state, S) with some vital signs, lab tests, and physical examinations at a specific time point. The patient responds to the treatment (reward, R).
Figure 3
Figure 3
Illustration of a proposed reinforcement learning framework to find optimal dynamic treatment therapy in patients with traumatic brain injury. P represents the probability of the outcome after treatment at each stage; r represents the reward after treatment at each stage.

Similar articles

Cited by

References

    1. Zhang Z., written on behalf of AME Big-Data Clinical Trial Collaborative Group Reinforcement learning in clinical medicine: a method to optimize dynamic treatment regime over time. Annals of Translational Medicine. 2019;7(14):p. 345. doi: 10.21037/atm.2019.06.75. - DOI - PMC - PubMed
    1. Sutton R. S., Barto A. G. Reinforcement Learning: An Introduction. Cambridge, MA, USA: MIT Press; 2018.
    1. Lavori P. W., Dawson R. Adaptive treatment strategies in chronic disease. Annual Review of Medicine. 2008;59(1):443–453. doi: 10.1146/annurev.med.59.062606.122232. - DOI - PMC - PubMed
    1. Stroup T. S., McEvoy J. P., Swartz M. S., et al. The National Institute of Mental Health Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE) project: schizophrenia trial design and protocol development. Schizophrenia Bulletin. 2003;29(1):15–31. doi: 10.1093/oxfordjournals.schbul.a006986. - DOI - PubMed
    1. Gaynes B. N., Warden D., Trivedi M. H., Wisniewski S. R., Fava M., Rush A. J. What did STAR∗D teach us? Results from a large-scale, practical, clinical trial for patients with depression. Psychiatric Services. 2009;60(11):1439–1445. doi: 10.1176/ps.2009.60.11.1439. - DOI - PubMed

LinkOut - more resources