Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces
- PMID: 33232240
- DOI: 10.1109/TNSRE.2020.3039970
Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces
Erratum in
-
Corrections to "Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces".IEEE Trans Neural Syst Rehabil Eng. 2021;29:2776. doi: 10.1109/TNSRE.2021.3080405. IEEE Trans Neural Syst Rehabil Eng. 2021. PMID: 35108199
Abstract
Autonomous brain machine interfaces (BMIs) aim to enable paralyzed people to self-evaluate their movement intention to control external devices. Previous reinforcement learning (RL)-based decoders interpret the mapping between neural activity and movements using the external reward for well-trained subjects, and have not investigated the task learning procedure. The brain has developed a learning mechanism to identify the correct actions that lead to rewards in the new task. This internal guidance can be utilized to replace the external reference to advance BMIs as an autonomous system. In this study, we propose to build an internally rewarded reinforcement learning-based BMI framework using the multi-site recording to demonstrate the autonomous learning ability of the BMI decoder on the new task. We test the model on the neural data collected over multiple days while the rats were learning a new lever discrimination task. The primary motor cortex (M1) and medial prefrontal cortex (mPFC) spikes are interpreted by the proposed RL framework into the discrete lever press actions. The neural activity of the mPFC post the action duration is interpreted as the internal reward information, where a support vector machine is implemented to classify the reward vs. non-reward trials with a high accuracy of 87.5% across subjects. This internal reward is used to replace the external water reward to update the decoder, which is able to adapt to the nonstationary neural activity during subject learning. The multi-cortical recording allows us to take in more cortical recordings as input and uses internal critics to guide the decoder learning. Comparing with the classic decoder using M1 activity as the only input and external guidance, the proposed system with multi-cortical recordings shows a better decoding accuracy. More importantly, our internally rewarded decoder demonstrates the autonomous learning ability on the new task as the decoder successfully addresses the time-variant neural patterns while subjects are learning, and works asymptotically as the subjects' behavioral learning progresses. It reveals the potential of endowing BMIs with autonomous task learning ability in the RL framework.
Similar articles
-
Neural Decoders Using Reinforcement Learning in Brain Machine Interfaces: A Technical Review.Front Syst Neurosci. 2022 Aug 26;16:836778. doi: 10.3389/fnsys.2022.836778. eCollection 2022. Front Syst Neurosci. 2022. PMID: 36090185 Free PMC article. Review.
-
Estimating Reward Function from Medial Prefrontal Cortex Cortical Activity using Inverse Reinforcement Learning.Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:3346-3349. doi: 10.1109/EMBC48229.2022.9871194. Annu Int Conf IEEE Eng Med Biol Soc. 2022. PMID: 36086257
-
Reinforcement Learning based Decoding Using Internal Reward for Time Delayed Task in Brain Machine Interfaces.Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:3351-3354. doi: 10.1109/EMBC44109.2020.9175964. Annu Int Conf IEEE Eng Med Biol Soc. 2020. PMID: 33018722
-
Intermediate Sensory Feedback Assisted Multi-Step Neural Decoding for Reinforcement Learning Based Brain-Machine Interfaces.IEEE Trans Neural Syst Rehabil Eng. 2022;30:2834-2844. doi: 10.1109/TNSRE.2022.3210700. Epub 2022 Oct 20. IEEE Trans Neural Syst Rehabil Eng. 2022. PMID: 36219654
-
Reward-dependent learning in neuronal networks for planning and decision making.Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0. Prog Brain Res. 2000. PMID: 11105649 Review.
Cited by
-
Neural Decoders Using Reinforcement Learning in Brain Machine Interfaces: A Technical Review.Front Syst Neurosci. 2022 Aug 26;16:836778. doi: 10.3389/fnsys.2022.836778. eCollection 2022. Front Syst Neurosci. 2022. PMID: 36090185 Free PMC article. Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources