Axiomatic methods, dopamine and reward prediction error
- PMID: 18678251
- DOI: 10.1016/j.conb.2008.07.007
Axiomatic methods, dopamine and reward prediction error
Abstract
The phasic firing rate of midbrain dopamine neurons has been shown to respond both to the receipt of rewarding stimuli, and the degree to which such stimuli are anticipated by the recipient. This has led to the hypothesis that these neurons encode reward prediction error (RPE)-the difference between how rewarding an event is, and how rewarding it was expected to be. However, the RPE model is one of a number of competing explanations for dopamine activity that have proved hard to disentangle, mainly because they are couched in terms of latent, or unobservable, variables. This article describes techniques for dealing with latent variables common in economics and decision theory, and reviews work that uses these techniques to provide simple, non-parametric tests of the RPE hypothesis, allowing clear differentiation between competing explanations.
Similar articles
-
Midbrain dopamine neurons encode a quantitative reward prediction error signal.Neuron. 2005 Jul 7;47(1):129-41. doi: 10.1016/j.neuron.2005.05.020. Neuron. 2005. PMID: 15996553 Free PMC article.
-
Reward prediction error computation in the pedunculopontine tegmental nucleus neurons.Ann N Y Acad Sci. 2007 May;1104:310-23. doi: 10.1196/annals.1390.003. Epub 2007 Mar 7. Ann N Y Acad Sci. 2007. PMID: 17344541 Review.
-
Dopamine neurons can represent context-dependent prediction error.Neuron. 2004 Jan 22;41(2):269-80. doi: 10.1016/s0896-6273(03)00869-9. Neuron. 2004. PMID: 14741107
-
Involvement of basal ganglia and orbitofrontal cortex in goal-directed behavior.Prog Brain Res. 2000;126:193-215. doi: 10.1016/S0079-6123(00)26015-9. Prog Brain Res. 2000. PMID: 11105648 Review.
-
Midbrain dopamine neurons encode decisions for future action.Nat Neurosci. 2006 Aug;9(8):1057-63. doi: 10.1038/nn1743. Epub 2006 Jul 23. Nat Neurosci. 2006. PMID: 16862149
Cited by
-
Frontal theta reflects uncertainty and unexpectedness during exploration and exploitation.Cereb Cortex. 2012 Nov;22(11):2575-86. doi: 10.1093/cercor/bhr332. Epub 2011 Nov 25. Cereb Cortex. 2012. PMID: 22120491 Free PMC article.
-
Frontal theta links prediction errors to behavioral adaptation in reinforcement learning.Neuroimage. 2010 Feb 15;49(4):3198-209. doi: 10.1016/j.neuroimage.2009.11.080. Epub 2009 Dec 5. Neuroimage. 2010. PMID: 19969093 Free PMC article.
-
Humans primarily use model-based inference in the two-stage task.Nat Hum Behav. 2020 Oct;4(10):1053-1066. doi: 10.1038/s41562-020-0905-y. Epub 2020 Jul 6. Nat Hum Behav. 2020. PMID: 32632333
-
Testing the reward prediction error hypothesis with an axiomatic model.J Neurosci. 2010 Oct 6;30(40):13525-36. doi: 10.1523/JNEUROSCI.1747-10.2010. J Neurosci. 2010. PMID: 20926678 Free PMC article.
-
Abnormal approach-related motivation but spared reinforcement learning in MDD: Evidence from fronto-midline Theta oscillations and frontal Alpha asymmetry.Cogn Affect Behav Neurosci. 2019 Jun;19(3):759-777. doi: 10.3758/s13415-019-00693-4. Cogn Affect Behav Neurosci. 2019. PMID: 30675690
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources