Credit assignment in multiple goal embodied visuomotor behavior
- PMID: 21833235
- PMCID: PMC3153784
- DOI: 10.3389/fpsyg.2010.00173
Credit assignment in multiple goal embodied visuomotor behavior
Abstract
The intrinsic complexity of the brain can lead one to set aside issues related to its relationships with the body, but the field of embodied cognition emphasizes that understanding brain function at the system level requires one to address the role of the brain-body interface. It has only recently been appreciated that this interface performs huge amounts of computation that does not have to be repeated by the brain, and thus affords the brain great simplifications in its representations. In effect the brain's abstract states can refer to coded representations of the world created by the body. But even if the brain can communicate with the world through abstractions, the severe speed limitations in its neural circuitry mean that vast amounts of indexing must be performed during development so that appropriate behavioral responses can be rapidly accessed. One way this could happen would be if the brain used a decomposition whereby behavioral primitives could be quickly accessed and combined. This realization motivates our study of independent sensorimotor task solvers, which we call modules, in directing behavior. The issue we focus on herein is how an embodied agent can learn to calibrate such individual visuomotor modules while pursuing multiple goals. The biologically plausible standard for module programming is that of reinforcement given during exploration of the environment. However this formulation contains a substantial issue when sensorimotor modules are used in combination: The credit for their overall performance must be divided amongst them. We show that this problem can be solved and that diverse task combinations are beneficial in learning and not a complication, as usually assumed. Our simulations show that fast algorithms are available that allot credit correctly and are insensitive to measurement noise.
Keywords: credit assignment; learning; modules; reinforcement; reward.
Figures









Similar articles
-
Representation in natural and artificial agents: an embodied cognitive science perspective.Z Naturforsch C J Biosci. 1998 Jul-Aug;53(7-8):480-503. doi: 10.1515/znc-1998-7-804. Z Naturforsch C J Biosci. 1998. PMID: 9755508
-
Credit assignment during movement reinforcement learning.PLoS One. 2013;8(2):e55352. doi: 10.1371/journal.pone.0055352. Epub 2013 Feb 8. PLoS One. 2013. PMID: 23408972 Free PMC article.
-
Inter-module credit assignment in modular reinforcement learning.Neural Netw. 2003 Sep;16(7):985-94. doi: 10.1016/S0893-6080(02)00235-6. Neural Netw. 2003. PMID: 14692633
-
Solving the Credit Assignment Problem With the Prefrontal Cortex.Front Neurosci. 2018 Mar 27;12:182. doi: 10.3389/fnins.2018.00182. eCollection 2018. Front Neurosci. 2018. PMID: 29636659 Free PMC article. Review.
-
Dendritic solutions to the credit assignment problem.Curr Opin Neurobiol. 2019 Feb;54:28-36. doi: 10.1016/j.conb.2018.08.003. Epub 2018 Sep 8. Curr Opin Neurobiol. 2019. PMID: 30205266 Review.
Cited by
-
Priorities for selection and representation in natural tasks.Philos Trans R Soc Lond B Biol Sci. 2013 Sep 9;368(1628):20130066. doi: 10.1098/rstb.2013.0066. Print 2013 Oct 19. Philos Trans R Soc Lond B Biol Sci. 2013. PMID: 24018727 Free PMC article.
-
Neural correlates of temporal credit assignment in the parietal lobe.PLoS One. 2014 Feb 11;9(2):e88725. doi: 10.1371/journal.pone.0088725. eCollection 2014. PLoS One. 2014. PMID: 24523935 Free PMC article.
-
A robotics-inspired scanpath model reveals the importance of uncertainty and semantic object cues for gaze guidance in dynamic scenes.J Vis. 2025 Feb 3;25(2):6. doi: 10.1167/jov.25.2.6. J Vis. 2025. PMID: 39928323 Free PMC article.
-
The role of uncertainty and reward on eye movements in a virtual driving task.J Vis. 2012 Dec 21;12(13):19. doi: 10.1167/12.13.19. J Vis. 2012. PMID: 23262151 Free PMC article.
-
Information-seeking, curiosity, and attention: computational and neural mechanisms.Trends Cogn Sci. 2013 Nov;17(11):585-93. doi: 10.1016/j.tics.2013.09.001. Epub 2013 Oct 12. Trends Cogn Sci. 2013. PMID: 24126129 Free PMC article. Review.
References
-
- Anderson J. (1983). The Architecture of Cognition. Cambridge, MA: Harvard University Press
-
- Arkin R. (1998). Behavior Based Robotics. Cambridge, MA: MIT Press
-
- Badler N. I., Phillips C. B., Webber B. L. (1993). Simulating Humans: Computer Graphics Animation and Control. New York, NY: Oxford University Press
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous