Model-based predictions for dopamine

Angela J Langdon¹, Melissa J Sharpe², Geoffrey Schoenbaum³, Yael Niv⁴

Affiliations

¹ Princeton Neuroscience Institute & Department of Psychology, Princeton University, Princeton, NJ 08540, United States. Electronic address: alangdon@princeton.edu.
² Princeton Neuroscience Institute & Department of Psychology, Princeton University, Princeton, NJ 08540, United States; National Institute on Drug Abuse, Baltimore, MD 21224, United States; School of Psychology, University of New South Wales, Australia.
³ National Institute on Drug Abuse, Baltimore, MD 21224, United States.
⁴ Princeton Neuroscience Institute & Department of Psychology, Princeton University, Princeton, NJ 08540, United States.

PMID: 29096115
PMCID: PMC6034703
DOI: 10.1016/j.conb.2017.10.006

Review

Model-based predictions for dopamine

Angela J Langdon et al. Curr Opin Neurobiol. 2018 Apr.

. 2018 Apr:49:1-7.

doi: 10.1016/j.conb.2017.10.006. Epub 2017 Oct 31.

Authors

Angela J Langdon¹, Melissa J Sharpe², Geoffrey Schoenbaum³, Yael Niv⁴

Affiliations

¹ Princeton Neuroscience Institute & Department of Psychology, Princeton University, Princeton, NJ 08540, United States. Electronic address: alangdon@princeton.edu.
² Princeton Neuroscience Institute & Department of Psychology, Princeton University, Princeton, NJ 08540, United States; National Institute on Drug Abuse, Baltimore, MD 21224, United States; School of Psychology, University of New South Wales, Australia.
³ National Institute on Drug Abuse, Baltimore, MD 21224, United States.
⁴ Princeton Neuroscience Institute & Department of Psychology, Princeton University, Princeton, NJ 08540, United States.

PMID: 29096115
PMCID: PMC6034703
DOI: 10.1016/j.conb.2017.10.006

Abstract

Phasic dopamine responses are thought to encode a prediction-error signal consistent with model-free reinforcement learning theories. However, a number of recent findings highlight the influence of model-based computations on dopamine responses, and suggest that dopamine prediction errors reflect more dimensions of an expected outcome than scalar reward value. Here, we review a selection of these recent results and discuss the implications and complications of model-based predictions for computational theories of dopamine and learning.

PubMed Disclaimer

Figures

**Figure 1**
Multiple dimensions of prediction in dopamine prediction errors. Consider a simple task in which a brief presentation of a light cue is repeatedly followed by a drop of vanilla milk after some fixed delay (middle). What would happen on a trial in which the light is followed by a drop of equally-preferred chocolate milk after a shorter delay? Model-free TDRL with a complete serial compound stimulus representation proposes that the cue triggers a discrete sequence of activity that represents sequential time points after the presentation of the cue (left; a number of neurons are depicted horizontally; their activity at different timepoints is portrayed vertically). At each timepoint, summation of this weighted representation produces a scalar estimate of future value (V), which dopamine neurons (DA) compare to obtained reward to compute a prediction error signal. The prediction error is then broadcast widely (red) and used to modify the weights for neurons that were recently active (circles on arrows). When an unexpectedly early, chocolate-flavored reward is delivered, the prediction error signals the difference in time-discounted value, and modifies the weights for the part of the representation that is active when the prediction error is signaled. In contrast, we propose that dopamine neurons have access to (and maybe aid in learning) dimensions of prediction other than scalar value, and these are used for computation and signaling of prediction errors (right). For example, after the presentation of the cue, multiple features of the predicted next event (in this case, a liquid reward) may be represented by (perhaps overlapping) populations of neurons through time (color gradient), including the predicted amount (for example, one drop), the delay to reward delivery (it will arrive after several seconds) and the flavor of the reward (vanilla milk). At the time of reward delivery, violations of the prediction along any of these dimensions may elicit a phasic response from dopamine neurons, though different neurons may be specialized for prediction errors corresponding to different dimensions. In this case, at the early presentation of a drop of chocolate milk, prediction errors are elicited for the timing of reward delivery as well as for flavor (red) but no prediction error arises for amount (black).

See this image and copyright information in PMC

References

1. Montague PR, Dayan P, Sejnowski TJ. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. Journal of Neuroscience. 1996;16:1936–1947. - PMC - PubMed
1. Schultz W, Dayan P, Montague PR. A neural substrate of prediction and reward. Science. 1997;275:1593–1599. - PubMed
1. Roesch MR, Calu DJ, Schoenbaum G. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nature Neuroscience. 2007;10:1615–1624. - PMC - PubMed
1. Eshel N, Bukwich M, Rao V, Hemmelder V, Tian J, Uchida N. Arithmetic and local circuitry underlying dopamine prediction errors. Nature. 2015;525:243–246. - PMC - PubMed
1. Niv Y, Schoenbaum G. Dialogues on prediction errors. Trends in Cognitive Sciences. 2008;12:265–272. - PubMed

Publication types

Actions
Actions
Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

R01 DA042065/DA/NIDA NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Model-based predictions for dopamine

Affiliations

Model-based predictions for dopamine

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources