Reward-dependent learning in neuronal networks for planning and decision making

doi:10.1016/S0079-6123(00)26016-0

Review

. 2000:126:217-29.

doi: 10.1016/S0079-6123(00)26016-0.

Reward-dependent learning in neuronal networks for planning and decision making

S Dehaene¹, J P Changeux

Affiliations

PMID: 11105649
DOI: 10.1016/S0079-6123(00)26016-0

Review

Reward-dependent learning in neuronal networks for planning and decision making

S Dehaene et al. Prog Brain Res. 2000.

. 2000:126:217-29.

doi: 10.1016/S0079-6123(00)26016-0.

Authors

S Dehaene¹, J P Changeux

Affiliation

¹ INSERM U. 334, Service Hospitalier Frédéric Joliot, CEA/DSV, Orsay, France. dehaene@shfj.cea.fr

PMID: 11105649
DOI: 10.1016/S0079-6123(00)26016-0

Abstract

Neuronal network models have been proposed for the organization of evaluation and decision processes in prefrontal circuitry and their putative neuronal and molecular bases. The models all include an implementation and simulation of an elementary reward mechanism. Their central hypothesis is that tentative rules of behavior, which are coded by clusters of active neurons in prefrontal cortex, are selected or rejected based on an evaluation by this reward signal, which may be conveyed, for instance, by the mesencephalic dopaminergic neurons with which the prefrontal cortex is densely interconnected. At the molecular level, the reward signal is postulated to be a neurotransmitter such as dopamine, which exerts a global modulatory action on prefrontal synaptic efficacies, either via volume transmission or via targeted synaptic triads. Negative reinforcement has the effect of destabilizing the currently active rule-coding clusters; subsequently, spontaneous activity varies again from one cluster to another, giving the organism the chance to discover and learn a new rule. Thus, reward signals function as effective selection signals that either maintain or suppress currently active prefrontal representations as a function of their current adequacy. Simulations of this variation-selection have successfully accounted for the main features of several major tasks that depend on prefrontal cortex integrity, such as the delayed-response test, the Wisconsin card sorting test, the Tower of London test and the Stroop test. For the more complex tasks, we have found it necessary to supplement the external reward input with a second mechanism that supplies an internal reward; it consists of an auto-evaluation loop which short-circuits the reward input from the exterior. This allows for an internal evaluation of covert motor intentions without actualizing them as behaviors, by simply testing them covertly by comparison with memorized former experiences. This element of architecture gives access to enhanced rates of learning via an elementary process of internal or covert mental simulation. We have recently applied these ideas to a new model, developed with M. Kerszberg, which hypothesizes that prefrontal cortex and its reward-related connections contribute crucially to conscious effortful tasks. This model distinguishes two main computational spaces within the human brain: a unique global workspace composed of distributed and heavily interconnected neurons with long-range axons, and a set of specialized and modular perceptual, motor, memory, evaluative and attentional processors. We postulate that workspace neurons are mobilized in effortful tasks for which the specialized processors do not suffice; they selectively mobilize or suppress, through descending connections, the contribution of specific processor neurons. In the course of task performance, workspace neurons become spontaneously co-activated, forming discrete though variable spatio-temporal patterns subject to modulation by vigilance signals and to selection by reward signals. A computer simulation of the Stroop task shows workspace activation to increase during acquisition of a novel task, effortful execution, and after errors. This model makes predictions concerning the spatio-temporal activation patterns during brain imaging of cognitive tasks, particularly concerning the conditions of activation of dorsolateral prefrontal cortex and anterior cingulate, their relation to reward mechanisms, and their specific reaction during error processing.

PubMed Disclaimer

Cited by

How does reward expectation influence cognition in the human brain?
Rowe JB, Eckstein D, Braver T, Owen AM. Rowe JB, et al. J Cogn Neurosci. 2008 Nov;20(11):1980-92. doi: 10.1162/jocn.2008.20140. J Cogn Neurosci. 2008. PMID: 18416677 Free PMC article.
The nature of blindsight: implications for current theories of consciousness.
Derrien D, Garric C, Sergent C, Chokron S. Derrien D, et al. Neurosci Conscious. 2022 Feb 28;2022(1):niab043. doi: 10.1093/nc/niab043. eCollection 2022. Neurosci Conscious. 2022. PMID: 35237447 Free PMC article. Review.
White matter alterations in anorexia nervosa: A systematic review of diffusion tensor imaging studies.
Martin Monzon B, Hay P, Foroughi N, Touyz S. Martin Monzon B, et al. World J Psychiatry. 2016 Mar 22;6(1):177-86. doi: 10.5498/wjp.v6.i1.177. eCollection 2016 Mar 22. World J Psychiatry. 2016. PMID: 27014606 Free PMC article.
Advances from neuroimaging studies in eating disorders.
Frank GK. Frank GK. CNS Spectr. 2015 Aug;20(4):391-400. doi: 10.1017/S1092852915000012. Epub 2015 Apr 23. CNS Spectr. 2015. PMID: 25902917 Free PMC article. Review.
The role of attention in conscious recollection.
De Brigard F. De Brigard F. Front Psychol. 2012 Feb 10;3:29. doi: 10.3389/fpsyg.2012.00029. eCollection 2012. Front Psychol. 2012. PMID: 22363305 Free PMC article.

See all "Cited by" articles

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
- Elsevier Science
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Reward-dependent learning in neuronal networks for planning and decision making

Affiliation

Reward-dependent learning in neuronal networks for planning and decision making

Authors

Affiliation

Abstract

Similar articles

Cited by

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Similar articles

Cited by

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources