Review

. 2021 Feb 22:2021:6657119.

doi: 10.1155/2021/6657119. eCollection 2021.

Reinforcement Learning in Neurocritical and Neurosurgical Care: Principles and Possible Applications

Ying Liu¹, Nidan Qiao^{2

3

4

5}, Yuksel Altinel⁵

Affiliations

¹ Lhorong People's Hospital, Tibet, China.
² Department of Neurosurgery, Huashan Hospital, Shanghai Medical School, Fudan University, Shanghai, China.
³ Shanghai Clinical Medical Center of Neurosurgery, Shanghai, China.
⁴ Neurosurgical Institute of Fudan University, Shanghai, China.
⁵ Medical Science in Clinical Investigation, Harvard Medical School, Boston, USA.

PMID: 33680069
PMCID: PMC7925047
DOI: 10.1155/2021/6657119

Review

Reinforcement Learning in Neurocritical and Neurosurgical Care: Principles and Possible Applications

Ying Liu et al. Comput Math Methods Med. 2021.

. 2021 Feb 22:2021:6657119.

doi: 10.1155/2021/6657119. eCollection 2021.

Authors

Ying Liu¹, Nidan Qiao^{2

3

4

5}, Yuksel Altinel⁵

Affiliations

¹ Lhorong People's Hospital, Tibet, China.
² Department of Neurosurgery, Huashan Hospital, Shanghai Medical School, Fudan University, Shanghai, China.
³ Shanghai Clinical Medical Center of Neurosurgery, Shanghai, China.
⁴ Neurosurgical Institute of Fudan University, Shanghai, China.
⁵ Medical Science in Clinical Investigation, Harvard Medical School, Boston, USA.

PMID: 33680069
PMCID: PMC7925047
DOI: 10.1155/2021/6657119

Abstract

Dynamic decision-making was essential in the clinical care of surgical patients. Reinforcement learning (RL) algorithm is a computational method to find sequential optimal decisions among multiple suboptimal options. This review is aimed at introducing RL's basic concepts, including three basic components: the state, the action, and the reward. Most medical studies using reinforcement learning methods were trained on a fixed observational dataset. This paper also reviews the literature of existing practical applications using reinforcement learning methods, which can be further categorized as a statistical RL study and a computational RL study. The review proposes several potential aspects where reinforcement learning can be applied in neurocritical and neurosurgical care. These include sequential treatment strategies of intracranial tumors and traumatic brain injury and intraoperative endoscope motion control. Several limitations of reinforcement learning are representations of basic components, the positivity violation, and validation methods.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflicts of interest.

Figures

**Figure 1**
(a) A patient with traumatic brain injury and intracranial hypertension; sequential treatment includes concentrated sodium, mechanical ventilation, sedation, and possible outcomes. (b) The trajectories (strategies) of three patients and their expected total reward from all treatments performed.

**Figure 2**
Uniform conceptions in reinforcement learning: the state, the action, and the reward. Physicians gave treatment (action, A) to the patient (state, S) with some vital signs, lab tests, and physical examinations at a specific time point. The patient responds to the treatment (reward, R).

**Figure 3**
Illustration of a proposed reinforcement learning framework to find optimal dynamic treatment therapy in patients with traumatic brain injury. P represents the probability of the outcome after treatment at each stage; r represents the reward after treatment at each stage.

See this image and copyright information in PMC

References

1. Zhang Z., written on behalf of AME Big-Data Clinical Trial Collaborative Group Reinforcement learning in clinical medicine: a method to optimize dynamic treatment regime over time. Annals of Translational Medicine. 2019;7(14):p. 345. doi: 10.21037/atm.2019.06.75. - DOI - PMC - PubMed
1. Sutton R. S., Barto A. G. Reinforcement Learning: An Introduction. Cambridge, MA, USA: MIT Press; 2018.
1. Lavori P. W., Dawson R. Adaptive treatment strategies in chronic disease. Annual Review of Medicine. 2008;59(1):443–453. doi: 10.1146/annurev.med.59.062606.122232. - DOI - PMC - PubMed
1. Stroup T. S., McEvoy J. P., Swartz M. S., et al. The National Institute of Mental Health Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE) project: schizophrenia trial design and protocol development. Schizophrenia Bulletin. 2003;29(1):15–31. doi: 10.1093/oxfordjournals.schbul.a006986. - DOI - PubMed
1. Gaynes B. N., Warden D., Trivedi M. H., Wisniewski S. R., Fava M., Rush A. J. What did STAR∗D teach us? Results from a large-scale, practical, clinical trial for patients with depression. Psychiatric Services. 2009;60(11):1439–1445. doi: 10.1176/ps.2009.60.11.1439. - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Reinforcement Learning in Neurocritical and Neurosurgical Care: Principles and Possible Applications

Affiliations

Reinforcement Learning in Neurocritical and Neurosurgical Care: Principles and Possible Applications

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical