Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces

Xiang Shen, Xiang Zhang, Yifan Huang, Shuhang Chen, Yiwen Wang

PMID: 33232240
DOI: 10.1109/TNSRE.2020.3039970

Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces

Xiang Shen et al. IEEE Trans Neural Syst Rehabil Eng. 2020 Dec.

. 2020 Dec;28(12):3089-3099.

doi: 10.1109/TNSRE.2020.3039970. Epub 2021 Jan 28.

Authors

Xiang Shen, Xiang Zhang, Yifan Huang, Shuhang Chen, Yiwen Wang

PMID: 33232240
DOI: 10.1109/TNSRE.2020.3039970

Erratum in

Corrections to "Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces".
Shen X, Zhang X, Huang Y, Chen S, Wang Y. Shen X, et al. IEEE Trans Neural Syst Rehabil Eng. 2021;29:2776. doi: 10.1109/TNSRE.2021.3080405. IEEE Trans Neural Syst Rehabil Eng. 2021. PMID: 35108199

Abstract

Autonomous brain machine interfaces (BMIs) aim to enable paralyzed people to self-evaluate their movement intention to control external devices. Previous reinforcement learning (RL)-based decoders interpret the mapping between neural activity and movements using the external reward for well-trained subjects, and have not investigated the task learning procedure. The brain has developed a learning mechanism to identify the correct actions that lead to rewards in the new task. This internal guidance can be utilized to replace the external reference to advance BMIs as an autonomous system. In this study, we propose to build an internally rewarded reinforcement learning-based BMI framework using the multi-site recording to demonstrate the autonomous learning ability of the BMI decoder on the new task. We test the model on the neural data collected over multiple days while the rats were learning a new lever discrimination task. The primary motor cortex (M1) and medial prefrontal cortex (mPFC) spikes are interpreted by the proposed RL framework into the discrete lever press actions. The neural activity of the mPFC post the action duration is interpreted as the internal reward information, where a support vector machine is implemented to classify the reward vs. non-reward trials with a high accuracy of 87.5% across subjects. This internal reward is used to replace the external water reward to update the decoder, which is able to adapt to the nonstationary neural activity during subject learning. The multi-cortical recording allows us to take in more cortical recordings as input and uses internal critics to guide the decoder learning. Comparing with the classic decoder using M1 activity as the only input and external guidance, the proposed system with multi-cortical recordings shows a better decoding accuracy. More importantly, our internally rewarded decoder demonstrates the autonomous learning ability on the new task as the decoder successfully addresses the time-variant neural patterns while subjects are learning, and works asymptotically as the subjects' behavioral learning progresses. It reveals the potential of endowing BMIs with autonomous task learning ability in the RL framework.

PubMed Disclaimer

Cited by

Neural Decoders Using Reinforcement Learning in Brain Machine Interfaces: A Technical Review.
Girdler B, Caldbeck W, Bae J. Girdler B, et al. Front Syst Neurosci. 2022 Aug 26;16:836778. doi: 10.3389/fnsys.2022.836778. eCollection 2022. Front Syst Neurosci. 2022. PMID: 36090185 Free PMC article. Review.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- IEEE Engineering in Medicine and Biology Society

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces

Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces

Authors

Erratum in

Abstract

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources