Reward and fictive prediction error signals in ventral striatum: asymmetry between factual and counterfactual processing
- PMID: 33839955
- DOI: 10.1007/s00429-021-02270-3
Reward and fictive prediction error signals in ventral striatum: asymmetry between factual and counterfactual processing
Abstract
Reward prediction error, the difference between the expected and obtained reward, is known to act as a reinforcement learning neural signal. In the current study, we propose a model fitting approach that combines behavioral and neural data to fit computational models of reinforcement learning. Briefly, we penalized subject-specific fitted parameters that moved away too far from the group median, except when that deviation led to an improvement in the model's fit to neural responses. By means of a probabilistic monetary learning task and fMRI, we compared our approach with standard model fitting methods. Q-learning outperformed actor-critic at both behavioral and neural level, although the inclusion of neuroimaging data into model fitting improved the fit of actor-critic models. We observed both action-value and state-value prediction error signals in the striatum, while standard model fitting approaches failed to capture state-value signals. Finally, left ventral striatum correlated with reward prediction error while right ventral striatum with fictive prediction error, suggesting a functional hemispheric asymmetry regarding prediction-error driven learning.
Keywords: Counterfactual; Fictive prediction error; Model fitting; Reward prediction error; fMRI.
References
-
- Abe H, Lee D (2011) Distributed coding of actual and hypothetical outcomes in the orbital and dorsolateral prefrontal cortex. Neuron 70:731–741. https://doi.org/10.1016/j.neuron.2011.03.026 - DOI - PubMed - PMC
-
- Aberg KC, Doell KC, Schwartz S (2015) Hemispheric asymmetries in striatal reward responses relate to approach-avoidance learning and encoding of positive-negative prediction errors in dopaminergic midbrain regions. J Neurosci 35:14491–14500. https://doi.org/10.1523/jneurosci.1859-15.2015 - DOI - PubMed - PMC
-
- Barto AG (1995) Adaptive Critics and the Basal Ganglia. In: Houk JC, Davis J, Beiser D (eds) Models of Information Processing in the Basal Ganglia. MIT Press, Cambridge, MA, pp 215–232
-
- Bartra O, Mcguire JT, Kable JW (2013) NeuroImage The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. Neuroimage 76:412–427. https://doi.org/10.1016/j.neuroimage.2013.02.063 - DOI - PubMed - PMC
-
- Bornstein AM, Daw ND (2012) Dissociating hippocampal and striatal contributions to sequential prediction learning. Euro J Neurosci 35:1011–1023. https://doi.org/10.1111/j.1460-9568.2011.07920.x - DOI
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
