Review

. 2011 Dec:1239:87-99.

doi: 10.1111/j.1749-6632.2011.06210.x.

Does the orbitofrontal cortex signal value?

Geoffrey Schoenbaum¹, Yuji Takahashi, Tzu-Lan Liu, Michael A McDannald

Affiliations

PMID: 22145878
PMCID: PMC3530400
DOI: 10.1111/j.1749-6632.2011.06210.x

Review

Does the orbitofrontal cortex signal value?

Geoffrey Schoenbaum et al. Ann N Y Acad Sci. 2011 Dec.

. 2011 Dec:1239:87-99.

doi: 10.1111/j.1749-6632.2011.06210.x.

Authors

Geoffrey Schoenbaum¹, Yuji Takahashi, Tzu-Lan Liu, Michael A McDannald

Affiliation

¹ Department of Anatomy and Neurobiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA. gscho002@umaryland.edu

PMID: 22145878
PMCID: PMC3530400
DOI: 10.1111/j.1749-6632.2011.06210.x

Abstract

The orbitofrontal cortex (OFC) has long been implicated in associative learning. Early work by Mishkin and Rolls showed that the OFC was critical for rapid changes in learned behavior, a role that was reflected in the encoding of associative information by orbitofrontal neurons. Over the years, new data-particularly neurophysiological data-have increasingly emphasized the OFC in signaling actual value. These signals have been reported to vary according to internal preferences and judgments and to even be completely independent of the sensory qualities of predictive cues, the actual rewards, and the responses required to obtain them. At the same time, increasingly sophisticated behavioral studies have shown that the OFC is often unnecessary for simple value-based behavior and instead seems critical when information about specific outcomes must be used to guide behavior and learning. Here, we review these data and suggest a theory that potentially reconciles these two ideas, value versus specific outcomes, and bodies of work on the OFC.

PubMed Disclaimer

Figures

**Figure 1**
OFC contribution to Pavlovian overexpectation, adapted from Takahashi *et al*. Shown is the experimental timeline linking conditioning, compound conditioning, and probe phases to data from each phase. Top and bottom rows of plots indicate control (saline infusion into the OFC) and OFCi group (muscimol + baclofen infusion into the OFC), respectively. In timeline and plots, V1 is a visual cue (a cue light); A1, A2, and A3 are auditory cues (tone, white noise, and clicker, counterbalanced), and O1 and O2 are different flavored sucrose pellets (banana and grape, counterbalanced). Position of the cannula within OFC in saline controls (gray dot) and OFCi (black dot) rats are shown beneath the timeline. (A) Percentage of individuals responding to food cup during cue presentation across 10 days of conditioning. Gray, black, and white squares indicate A1, A2, and A3 cues, respectively. (B) Percentage of those responding to food cup during cue presentation across four days of compound training. Gray, black, and white squares indicate A1/V1, A2, and A3 cues, respectively. Gray and black bars in the insets indicate average normalized percentage responding to A1/V1 and A2, respectively. (C) Percentage of those responding to food cup during cue presentation in the probe test. Line graph shows those responding across the eight trials, and the bar graph shows average number responding in these eight trials. Gray, black, and white colors indicate A1, A2, and A3 cues, respectively. * P < 0.05 and ** P < 0.01 on *post hoc* contrast testing. NS, not significant. Error bars denote SEM.

**Figure 2**
OFC contribution to reinforcer-specific Pavlovian to instrumental transfer adapted from Ostlund *et al*. Pavlovian training consisting of pairing two different auditory cues with either sucrose or pellet rewards. Sham or OFC lesions were given before or following Pavlovian training. Rats were trained that a right lever press produced pellets while a left lever press produced sucrose solution (counterbalanced). Each rat was tested twice, each with one lever present. Mean lever-press rates are shown during a cue-free baseline period (white bars), a cue signaling the same outcome as the lever press (black bar), and a cue signaling a different outcome (gray bar). *Pre* means Sham or OFC lesion was given before Pavlovian conditioning; *post* means Sham or OFC lesion was given following Pavlovian conditioning. Error bars denote SEM.

**Figure 3**
OFC contribution to conditioned reinforcement adapted from Burke *et al*. Shown is the experimental timeline linked to data from each conditioned reinforcement (CRf) test. In the timeline and figures, A, B, X, and Y are training cues; R1 and R2 are instrumental responses; and O1 and O2 are different flavored sucrose pellet reinforcers. Normal (open bars) and OFC-lesioned rats (black bars) were first trained to associate two different auditory cues (A and B) with two differently flavored sugar pellets (banana and grape). After these associations were learned, an “unblocking” phase occurred in which A was compounded with a novel cue, and X and B compounded with a novel cue Y. While AX predicted the original flavor, BY predicted another flavor. This arrangement has been shown to “block” learning to X but “unblock” learning to Y. (A–D) Lever pressing for A versus X, or Y versus X, in control (open bars) and lesioned (filled bars) rats before (A, B) and after (C, D) devaluation. Lesions diminished responding for Y before devaluation (A, B); controls diminished lever pressing for Y after devaluation (C, D). Lever pressing is averaged across two 30-min sessions in each figure. Asterisks indicate significance at P < 0.05 on *post hoc* contrast testing; the gray numbers indicate the ratio of responding on the two levers for each significant comparison. NS, not significant. Error bars denote SD.

**Figure 4**
OFC contribution to differential outcome expectancy; adapted from McDannald *et al*. (A) Sham and OFC-lesioned rats were required to learn the following stimulus-action-outcome associations: S1 → R1 → O1, S2 → R2 → O2, and S3 → R3 → O1/O2. S1–3 were different auditory cues, R1–3 different operant responses, and O1,2 different flavors of sucrose. Performance during S1, in which R1 responses were always reinforced with O1, and errors, consisting of R2 or R3 responses, were not reinforced. R2 responses were always reinforced with a different outcome (O2) on S2 trials, whereas R3 was reinforced with the same outcome (O1) as R1 on one-half of the S3 presentations and with O2 on the other half of the S3 trials. (B) Responding during S1 trials is shown. Sham rats showed more shared-outcome errors (R3) than different-outcome errors (R2) during S1. This result can be attributed to the rats' use of specific outcome expectancies to guide responding: responses reinforced in the presence of different expectancies (e.g., R1 and R2) were differentiated more readily than responses that yielded the same outcome (e.g., R1 and R3). In contrast, OFC-lesioned rats showed equivalent levels of shared- and different-outcome errors. The absence of any difference in shared- and different-outcome errors in these rats is consistent with impairment in the associative basis for outcome expectancy learning. Error bars denote SEM.

**Figure 5**
OFC contribution to value and identity unblocking; adapted from McDannald *et al*. (A) Rats were first trained to associate three different visual cues (A, B, C) with three different amounts (3 or 1) and identities (O1 or O2; banana-flavored sugar pellets or grape-flavored sugar pellets) of reward: A → 3xO1, B → 3xO2, and C → O1. Following this phase an unblocking phase was given in which compounds of the original cues and three novel auditory cues, AX, BY and CZ, all predicted 3xO1. In this way X signaled the expected reward (Blocked), Y signaled a differently flavored but similarly valued reward (Identity) and Z signaled a differently valued but similarly flavored reward (Value). (B) Food cup responding to the Blocked cue (which signaled the expected number of pellets) and the Value cue (which signaled greater than expected reinforcement) for Control and OFC-lesioned rats. (C) Food cup responding to the Blocked cue (which signaled the expected flavor of pellets) and the Identity cue (which signaled an equally preferred but differently flavored pellet) for Control and OFC-lesioned rats. (D) The relationship between value sensitivity to cues A and B (which predicted equal amounts of different-flavored sugar pellets) in initial conditioning and the display of identity unblocking for Control and OFC-lesioned rats. Asterisks indicate significance at P < 0.05 on *post hoc* contrast testing. NS, not significant. Error bars denote SEM.

See this image and copyright information in PMC

Cited by

Reinforcement learning models and their neural correlates: An activation likelihood estimation meta-analysis.
Chase HW, Kumar P, Eickhoff SB, Dombrovski AY. Chase HW, et al. Cogn Affect Behav Neurosci. 2015 Jun;15(2):435-59. doi: 10.3758/s13415-015-0338-7. Cogn Affect Behav Neurosci. 2015. PMID: 25665667 Free PMC article. Review.
Imbalanced Activity in the Orbitofrontal Cortex and Nucleus Accumbens Impairs Behavioral Inhibition.
Meyer HC, Bucci DJ. Meyer HC, et al. Curr Biol. 2016 Oct 24;26(20):2834-2839. doi: 10.1016/j.cub.2016.08.034. Epub 2016 Sep 29. Curr Biol. 2016. PMID: 27693139 Free PMC article.
Foundations of human spatial problem solving.
Zarr N, Brown JW. Zarr N, et al. Sci Rep. 2023 Jan 27;13(1):1485. doi: 10.1038/s41598-023-28834-3. Sci Rep. 2023. PMID: 36707649 Free PMC article.
A Literature Review on the Efficacy and Related Neural Effects of Pharmacological and Psychosocial Treatments in Individuals With Internet Gaming Disorder.
Seo EH, Yang HJ, Kim SG, Park SC, Lee SK, Yoon HJ. Seo EH, et al. Psychiatry Investig. 2021 Dec;18(12):1149-1163. doi: 10.30773/pi.2021.0207. Epub 2021 Dec 8. Psychiatry Investig. 2021. PMID: 34872237 Free PMC article.
Transfer of learning relates to intrinsic connectivity between hippocampus, ventromedial prefrontal cortex, and large-scale networks.
Gerraty RT, Davidow JY, Wimmer GE, Kahn I, Shohamy D. Gerraty RT, et al. J Neurosci. 2014 Aug 20;34(34):11297-303. doi: 10.1523/JNEUROSCI.0185-14.2014. J Neurosci. 2014. PMID: 25143610 Free PMC article.

See all "Cited by" articles

References

1. Jones B, Mishkin M. Limbic lesions and the problem of stimulus-reinforcement associations. Exp. Neurol. 1972;36:362–377. - PubMed
1. Butter CM. Perseveration and extinction in discrimination reversal tasks following selective frontal ablations in Macaca mulatta. Physiol. Behav. 1969;4:163–171.
1. Rolls ET, et al. Emotion-related learning in patients with social and emotional changes associated with frontal lobe damage. J. Neurol. Neurosurg. Psychiatr. 1994;57:1518–1524. - PMC - PubMed
1. Meunier M, Bachevalier J, Mishkin M. Effects of orbital frontal and anterior cingulate lesions on object and spatial memory in rhesus monkeys. Neuropsychologia. 1997;35:999–1015. - PubMed
1. McAlonan K, Brown VJ. Orbital prefrontal cortex mediates reversal learning and not attentional set shifting in the rat. Behav. Brain Res. 2003;146:97–130. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Does the orbitofrontal cortex signal value?

Affiliation

Does the orbitofrontal cortex signal value?

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources