Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Feb;5(2):e1000284.
doi: 10.1371/journal.pcbi.1000284. Epub 2009 Feb 13.

Can monkeys choose optimally when faced with noisy stimuli and unequal rewards?

Affiliations

Can monkeys choose optimally when faced with noisy stimuli and unequal rewards?

Samuel Feng et al. PLoS Comput Biol. 2009 Feb.

Abstract

We review the leaky competing accumulator model for two-alternative forced-choice decisions with cued responses, and propose extensions to account for the influence of unequal rewards. Assuming that stimulus information is integrated until the cue to respond arrives and that firing rates of stimulus-selective neurons remain well within physiological bounds, the model reduces to an Ornstein-Uhlenbeck (OU) process that yields explicit expressions for the psychometric function that describes accuracy. From these we compute strategies that optimize the rewards expected over blocks of trials administered with mixed difficulty and reward contingencies. The psychometric function is characterized by two parameters: its midpoint slope, which quantifies a subject's ability to extract signal from noise, and its shift, which measures the bias applied to account for unequal rewards. We fit these to data from two monkeys performing the moving dots task with mixed coherences and reward schedules. We find that their behaviors averaged over multiple sessions are close to optimal, with shifts erring in the direction of smaller penalties. We propose two methods for biasing the OU process to produce such shifts.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. The motion discrimination task.
Target colors cue the magnitude of rewards for correct responses, red denoting a value twice that of blue. The four panels in the reward segment show the possible reward conditions. See text for full description.
Figure 2
Figure 2. A typical state space of the LCA model, showing nullclines on which for (thin curves), fixed points (filled circles with arrows indicating stability types) and slow manifold (dashed line).
Diagonal solid line represents one-dimensional state space formula image of reduced OU model, with associated probability distribution formula image of sample paths.
Figure 3
Figure 3. Psychometric functions showing fraction of T1 choices as a function of coherence for constant reward bias applied before and during motion period.
(A) formula image; (B) formula image; (C) formula image; each panel shows the cases formula image and −0.1 (left to right). Remaining parameters are formula image and formula image (arbitrary time units). Green lines indicate slopes for zero bias; arrows show shifts.
Figure 4
Figure 4. Optimal shifts as a function of the reward ratio r 1/r 2 for fixed coherences (solid blue curves) and for coherence ranges centered on the fixed coherences (dashed red curves).
(A): formula image = 10; 20 and 30% (top left to bottom right, solid blue), and [C 1;C 2] = [5; 15]; [15; 25] and [25; 35] (top left to bottom right, dashed red). (B): Coherence bands centered on formula image = 20% (solid blue curve) with widths 10; 20; 30 and 40% (bottom left to top right, dashed red). Approximation of Eq. (30) shown in green. The slope b 1 is fixed at 0.06 throughout.
Figure 5
Figure 5. Fits of accuracy data from monkeys A (A) and T (B) to the PMF (15), for the four reward conditions averaged over all sessions.
Bars denote standard errors. See text for details.
Figure 6
Figure 6. Optimal shifts b 2 for a range of reward ratios r 1/r 2 and b 1 = 0.0508 (solid, black) and b 1 = 0.0432 (dot-dashed, red), corresponding to slopes of PMFs fitted to equal rewards data for monkeys A and T.
Vertical dotted lines at r 1/r 2 = 0.5 and 2 intersect the curves at the symmetrically-placed optimal shifts for those reward ratios. (A) Predictions for the different sets of nonuniformly-distributed coherences viewed by each animal. (B) Results for coherences distributed uniformly from −48% to 48%: note smaller optimal shifts and reversal of order of curves for A and T compared to panel A. Triangles and crosses respectively indicate shifts determined from data for monkeys A and T for r 1/r 2 = 0.5, 1 and 2 (cf. Table 1).
Figure 7
Figure 7. Contours (black curves) of expected rewards for for monkeys A (A) and T (B) over the ()-plane, based on the coherences viewed by each animal.
Vertical dashed lines indicate formula image values fitted to pooled equal rewards data. Note that gradients in formula image in either direction away from ridges of maximum expected rewards (blue curves) become smaller as formula image decreases, that gradients are smaller for overshifts in formula image than for undershifts, that this asymmetry increases as formula image decreases, and that gradients are steeper for T than for A. See text for discussion.
Figure 8
Figure 8. Optimal PMFs (black curves) and bands (color) in which 99.5% of maximal possible rewards are gained, compared with session-averaged HL, LL and HH, and LH data (triangles, left to right on each panel) for monkeys A (A) and T (B).
See text for details.
Figure 9
Figure 9. Slope and shift values for individual sessions and the four reward conditions, plotted as points in the for monkeys A (four panels in (A)) and T (four panels in (B)).
Asterisks indicate values averaged over all sessions (cf. top two rows of Table 1). Performance curves and bands show optimal formula image values for given formula image values (central blue curves) and values that gain 99% and 97% of maximum rewards are also shown (flanking magenta curves closest to and farthest from blue curves, respectively).

References

    1. Schall J. Neural basis of deciding, choosing and acting. Nat Rev Neurosci. 2001;2:33–42. - PubMed
    1. Gold J, Shadlen M. Neural computations that underlie decisions about sensory stimuli. Trends Cognitive Sci. 2001;5:10–16. - PubMed
    1. Mazurek M, Roitman J, Ditterich J, Shadlen M. A role for neural integrators in perceptual decision making. Cereb Cortex. 2003;13:891–898. - PubMed
    1. Smith P, Ratcliff R. Psychology and neurobiology of simple decisions. Trends Neurosci. 2004;27:161–168. - PubMed
    1. Britten K, Shadlen M, Newsome W, Movshon J. The analysis of visual motion: A comparison of neuronal and psychophysical performance. J Neurosci. 1992;12:4745–4765. - PMC - PubMed

Publication types