A neural signature of hierarchical reinforcement learning
- PMID: 21791294
- PMCID: PMC3145918
- DOI: 10.1016/j.neuron.2011.05.042
A neural signature of hierarchical reinforcement learning
Abstract
Human behavior displays hierarchical structure: simple actions cohere into subtask sequences, which work together to accomplish overall task goals. Although the neural substrates of such hierarchy have been the target of increasing research, they remain poorly understood. We propose that the computations supporting hierarchical behavior may relate to those in hierarchical reinforcement learning (HRL), a machine-learning framework that extends reinforcement-learning mechanisms into hierarchical domains. To test this, we leveraged a distinctive prediction arising from HRL. In ordinary reinforcement learning, reward prediction errors are computed when there is an unanticipated change in the prospects for accomplishing overall task goals. HRL entails that prediction errors should also occur in relation to task subgoals. In three neuroimaging studies we observed neural responses consistent with such subgoal-related reward prediction errors, within structures previously implicated in reinforcement learning. The results reported support the relevance of HRL to the neural processes underlying hierarchical behavior.
Copyright © 2011 Elsevier Inc. All rights reserved.
Figures
Comment in
-
How to perfect a chocolate soufflé and other important problems.Neuron. 2011 Jul 28;71(2):203-5. doi: 10.1016/j.neuron.2011.07.004. Neuron. 2011. PMID: 21791280
References
-
- Badre D. Cognitive control, hierarchy, and the rostro-caudal organization of the frontal lobes. Trends Cogn. Sci. 2008;12:193–200. - PubMed
-
- Baker TE, Holroyd CB. Dissociated roles of the anterior cingulate cortex in reward and conflict processing as revealed by the feedback error-related negativity and N200. Biol. Psychol. in press. - PubMed
-
- Barto A, Mahadevan S. Recent advances in hierarchical reinforcement learning. Disc. Event Dyn. Sys. 2003;13:341–379.
-
- Barto AG. Adaptive critics and the basal ganglia. In: Houk JC, Davis J, Beiser D, editors. Models of Information Processing in the Basal Ganglia. Cambridge, MA: MIT Press; 1995. pp. 215–232.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
