Goal-directed learning of features and forward models

Sohrab Saeb¹, Cornelius Weber, Jochen Triesch

Affiliations

PMID: 19616917
DOI: 10.1016/j.neunet.2009.06.049

Goal-directed learning of features and forward models

Sohrab Saeb et al. Neural Netw. 2009 Jul-Aug.

. 2009 Jul-Aug;22(5-6):586-92.

doi: 10.1016/j.neunet.2009.06.049. Epub 2009 Jul 8.

Authors

Sohrab Saeb¹, Cornelius Weber, Jochen Triesch

Affiliation

¹ Frankfurt Institute for Advanced Studies, Goethe University, Frankfurt am Main, Germany. saeb@fias.uni-frankfurt.de

PMID: 19616917
DOI: 10.1016/j.neunet.2009.06.049

Abstract

The brain is able to perform actions based on an adequate internal representation of the world, where task-irrelevant features are ignored and incomplete sensory data are estimated. Traditionally, it is assumed that such abstract state representations are obtained purely from the statistics of sensory input for example by unsupervised learning methods. However, more recent findings suggest an influence of the dopaminergic system, which can be modeled by a reinforcement learning approach. Standard reinforcement learning algorithms act on a single layer network connecting the state space to the action space. Here, we involve in a feature detection stage and a memory layer, which together, construct the state space for a learning agent. The memory layer consists of the state activation at the previous time step as well as the previously chosen action. We present a temporal difference based learning rule for training the weights from these additional inputs to the state layer. As a result, the performance of the network is maintained both, in the presence of task-irrelevant features, and at randomly occurring time steps during which the input is invisible. Interestingly, a goal-directed forward model emerges from the memory weights, which only covers the state-action pairs that are relevant to the task. The model presents a link between reinforcement learning, feature detection and forward models and may help to explain how reward systems recruit cortical circuits for goal-directed feature detection and prediction.

PubMed Disclaimer

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Elsevier Science
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Goal-directed learning of features and forward models

Affiliation

Goal-directed learning of features and forward models

Authors

Affiliation

Abstract

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Research Materials