. 2010 Mar 30;107(13):6010-5.

doi: 10.1073/pnas.0912838107. Epub 2010 Mar 15.

The neural code of reward anticipation in human orbitofrontal cortex

Thorsten Kahnt¹, Jakob Heinzle, Soyoung Q Park, John-Dylan Haynes

Affiliations

PMID: 20231475
PMCID: PMC2851854
DOI: 10.1073/pnas.0912838107

The neural code of reward anticipation in human orbitofrontal cortex

Thorsten Kahnt et al. Proc Natl Acad Sci U S A. 2010.

. 2010 Mar 30;107(13):6010-5.

doi: 10.1073/pnas.0912838107. Epub 2010 Mar 15.

Authors

Thorsten Kahnt¹, Jakob Heinzle, Soyoung Q Park, John-Dylan Haynes

Affiliation

¹ Bernstein Center for Computational Neuroscience, Charité-Universitätsmedizin Berlin, 10115 Berlin, Germany. kahnt@bccn-berlin.de

PMID: 20231475
PMCID: PMC2851854
DOI: 10.1073/pnas.0912838107

Abstract

An optimal choice among alternative behavioral options requires precise anticipatory representations of their possible outcomes. A fundamental question is how such anticipated outcomes are represented in the brain. Reward coding at the level of single cells in the orbitofrontal cortex (OFC) follows a more heterogeneous coding scheme than suggested by studies using functional MRI (fMRI) in humans. Using a combination of multivariate pattern classification and fMRI we show that the reward value of sensory cues can be decoded from distributed fMRI patterns in the OFC. This distributed representation is compatible with previous reports from animal electrophysiology that show that reward is encoded by different neural populations with opposing coding schemes. Importantly, the fMRI patterns representing specific values during anticipation are similar to those that emerge during the receipt of reward. Furthermore, we show that the degree of this coding similarity is related to subjects' ability to use value information to guide behavior. These findings narrow the gap between reward coding in humans and animals and corroborate the notion that value representations in OFC are independent of whether reward is anticipated or actually received.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Fig. 1.**
Experimental design and behavioral results. (A) After presentation of a sensory cue, subjects had to judge either the main rotation direction or the color of the dots (randomized). The reward outcome was delivered after correct responses. (B) Sensory cues consisted of colored rotating dots that were associated with reward value in a logical XOR fashion. The combination of rotation direction and color was reward predicting, whereas color and rotation direction alone were not informative about the outcome. Two stimulus combinations that do not share any sensory properties predict high rewards (e.g., G&CW and R&CCW) and two predict low rewards (e.g., G&CCW and R&CW). CW, clockwise; CCW, counterclockwise. An example pairing is shown here (the actual parings were counterbalanced across subjects). All 16 cells were presented once in each run. To compensate for the overrepresentation of intermediate values, the four extreme values were presented two additional times each (16 + 2 × 4 = 24 trials). (C) Subjective ratings from a postscanning rating session increased as a function of reward value, suggesting that subjects indeed were aware of the link between cues and reward levels. All differences are significant (P < 0.001, Bonferroni corrected). Error bars for SEM are smaller than the symbols.

**Fig. 2.**
Multivariate pattern classification. (A) Cues were sorted into four groups according to their sensory properties. Two cues predicted a high reward (7 and 10 points), whereas the remaining two predicted a low reward (0 and 3 points). (B) A support vector classifier (SVC) was trained on "training" data from nine scanning runs to classify fMRI patterns evoked by one specific pair of low- vs. high-value cues (e.g., G&CCW vs. R&CCW). From the remaining test data set (run 10), fMRI patterns to cues that also predicted low and high values but had different sensory properties were used to test the performance of the SVC (e.g., R&CW vs. G&CW). In total, this procedure was performed on four different training-test pairs each time as a 10-fold leave-one-out cross-validation. (C) We searched in every local cluster of brain activity for information about the reward value during anticipation using a searchlight approach (37, 38). For every voxel in the brain, the fMRI patterns in the local cluster surrounding this voxel were extracted for each cue and each scanning run separately. Then the decoding procedure described in B was performed on that data.

**Fig. 3.**
Decoding of reward value during anticipation. (A) Distributed fMRI patterns in the medial OFC [MNI coordinates: (3, 33, −6), t = 6.90] and the ventral striatum [VS (6, 6, −6), t = 5.65] represent the value of anticipated outcomes independent of the sensory properties of the cues. T map based on the decoding accuracies of all four training-test pairs is thresholded at P < 0.05, FWE whole-brain corrected with a cluster extent threshold of k = 30 voxels, and overlaid on a normalized T1-weighted image averaged across subjects. (B) Bar graphs show average decoding accuracy across subjects (% correct classified, chance level is 50%) for the different training-test pairs (nos. 1–4; see Fig. 2B) and error bars depict SEM. Please note that the decoding accuracy only provides a lower bound on information. The predictive accuracy at the level of populations of single cells could potentially be substantially higher if only a subpopulation of cells is modulated by reward, as suggested by electrophysiological studies in primates (10, 24, 26).

**Fig. 4.**
Similar value-coding fMRI patterns in the OFC during anticipation and receipt of reward. (A) In the medial OFC [MNI coordinates: (3, 54, −15), t = 6.20] similar fMRI patterns represent value during both anticipation and receipt of reward. The t map based on decoding accuracies from both training-test pairs is thresholded at P < 0.05 (FWE whole-brain corrected; cluster extent threshold k = 30 voxels) and overlaid on a normalized T1-weighted image averaged across subjects. (B) The surface plot depicts voxel selectivities (support vector weights, SV weights) in the spherical cluster surrounding the individual peak voxel in medial OFC for one subject. The selectivity of each voxel for either low or high values is color coded in blue and yellow, respectively. (*Left*) SV weights from the SVC trained on fMRI patterns during anticipation and (*Right*) during receipt of reward. Scatter plot in the middle illustrates the similarity between the voxel selectivities during anticipation (x axis) and receipt of reward (y axis). 3D patterns from all subjects are shown in Figs. S3 and S4. (C) Significant relationship (r = 0.51, P < 0.05) between the correlation of the fMRI patterns in the medial OFC during anticipation and receipt of reward (pattern similarity, x axis) and the coefficients of determination (R²) describing the subjective association between the sensory cue and reward value (subjective association, y axis) obtained from the postscanning ratings. There was no significant relationship in the mPFC (P = 0.65) or the dACC cluster (P = 0.09). (D) Significant relationship (r = 0.61, P < 0.05) between pattern similarity in OFC (x axis) and the modulation of performance (% correct) by expected value (y axis) during the task. There was no significant relationship in the mPFC (P = 0.14) or the dACC (P = 0.29).

See this image and copyright information in PMC

Cited by

The neural basis of temporal individuation and its capacity limits in the human brain.
Naughtin CK, Tamber-Rosenau BJ, Dux PE. Naughtin CK, et al. J Neurophysiol. 2014 Feb;111(3):499-512. doi: 10.1152/jn.00534.2013. Epub 2013 Nov 6. J Neurophysiol. 2014. Retraction in: J Neurophysiol. 2016 Nov 1;116(5):2467. doi: 10.1152/jn.z9k-3963-retr.2016. PMID: 24198320 Free PMC article. Retracted.
BOLD subjective value signals exhibit robust range adaptation.
Cox KM, Kable JW. Cox KM, et al. J Neurosci. 2014 Dec 3;34(49):16533-43. doi: 10.1523/JNEUROSCI.3927-14.2014. J Neurosci. 2014. PMID: 25471589 Free PMC article.
The neural basis of temporal individuation and its capacity limits in the human brain.
Naughtin CK, Tamber-Rosenau BJ, Dux PE. Naughtin CK, et al. J Neurophysiol. 2017 Nov 1;118(5):2601-2613. doi: 10.1152/jn.00839.2016. Epub 2017 Aug 30. J Neurophysiol. 2017. PMID: 28855297 Free PMC article.
Effect of carbonated water on cerebral blood flow in the frontal region: a study using near-infrared spectroscopy.
Kosugi W, Sumali B, Hamada N, Mitsukura Y. Kosugi W, et al. Front Behav Neurosci. 2024 Dec 10;18:1409123. doi: 10.3389/fnbeh.2024.1409123. eCollection 2024. Front Behav Neurosci. 2024. PMID: 39720307 Free PMC article.
Differential maturation of the brain networks required for the sensory, emotional, and cognitive aspects of pain in human newborns.
Jones L, Batalle D, Meek J, Edwards AD, Fitzgerald M, Arichi T, Fabrizi L. Jones L, et al. Pain. 2025 Jun 18:10.1097/j.pain.0000000000003619. doi: 10.1097/j.pain.0000000000003619. Online ahead of print. Pain. 2025. PMID: 40532739 Free PMC article.

See all "Cited by" articles

References

1. Rolls ET. The orbitofrontal cortex and reward. Cereb Cortex. 2000;10:284–294. - PubMed
1. Kringelbach ML. The human orbitofrontal cortex: Linking reward to hedonic experience. Nat Rev Neurosci. 2005;6:691–702. - PubMed
1. Murray EA, O'Doherty JP, Schoenbaum G. What we know and do not know about the functions of the orbitofrontal cortex after 20 years of cross-species studies. J Neurosci. 2007;27:8166–8169. - PMC - PubMed
1. O'Doherty JP. Lights, camembert, action! The role of human orbitofrontal cortex in encoding stimuli, rewards and choices. Ann N Y Acad Sci. 2007;1121:254–272. - PubMed
1. Schoenbaum G, Saddoris MP, Stalnaker TA. Reconciling the roles of orbitofrontal cortex in reversal learning and the encoding of outcome expectancies. Ann N Y Acad Sci. 2007;1121:320–335. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Molecular Biology Databases
- NIAID Data Ecosystem - Find datasets on Infectious and Immune-mediated Diseases
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The neural code of reward anticipation in human orbitofrontal cortex

Affiliation

The neural code of reward anticipation in human orbitofrontal cortex

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Molecular Biology Databases

Miscellaneous