Randomized Controlled Trial

. 2014 Jun 18;34(25):8594-604.

doi: 10.1523/JNEUROSCI.0277-14.2014.

Altering spatial priority maps via reward-based learning

Leonardo Chelazzi¹, Jana Eštočinová², Riccardo Calletti³, Emanuele Lo Gerfo⁴, Ilaria Sani³, Chiara Della Libera³, Elisa Santandrea³

Affiliations

¹ Department of Neurological and Movement Sciences, University of Verona, 37134 Verona, Italy, Italian Institute of Neuroscience, 37134 Verona, Italy, leonardo.chelazzi@univr.it.
² Department of Neurological and Movement Sciences, University of Verona, 37134 Verona, Italy, Centre of Applied Computer Science, Pavol Jozef Šafárik University, 04154 Košice, Slovakia, and.
³ Department of Neurological and Movement Sciences, University of Verona, 37134 Verona, Italy.
⁴ Department of Neurological and Movement Sciences, University of Verona, 37134 Verona, Italy, Department of Economics, Management and Statistics, University of Milan Bicocca, 20126 Milan, Italy.

PMID: 24948813
PMCID: PMC6608215
DOI: 10.1523/JNEUROSCI.0277-14.2014

Randomized Controlled Trial

Altering spatial priority maps via reward-based learning

Leonardo Chelazzi et al. J Neurosci. 2014.

. 2014 Jun 18;34(25):8594-604.

doi: 10.1523/JNEUROSCI.0277-14.2014.

Authors

Leonardo Chelazzi¹, Jana Eštočinová², Riccardo Calletti³, Emanuele Lo Gerfo⁴, Ilaria Sani³, Chiara Della Libera³, Elisa Santandrea³

Affiliations

¹ Department of Neurological and Movement Sciences, University of Verona, 37134 Verona, Italy, Italian Institute of Neuroscience, 37134 Verona, Italy, leonardo.chelazzi@univr.it.
² Department of Neurological and Movement Sciences, University of Verona, 37134 Verona, Italy, Centre of Applied Computer Science, Pavol Jozef Šafárik University, 04154 Košice, Slovakia, and.
³ Department of Neurological and Movement Sciences, University of Verona, 37134 Verona, Italy.
⁴ Department of Neurological and Movement Sciences, University of Verona, 37134 Verona, Italy, Department of Economics, Management and Statistics, University of Milan Bicocca, 20126 Milan, Italy.

PMID: 24948813
PMCID: PMC6608215
DOI: 10.1523/JNEUROSCI.0277-14.2014

Abstract

Spatial priority maps are real-time representations of the behavioral salience of locations in the visual field, resulting from the combined influence of stimulus driven activity and top-down signals related to the current goals of the individual. They arbitrate which of a number of (potential) targets in the visual scene will win the competition for attentional resources. As a result, deployment of visual attention to a specific spatial location is determined by the current peak of activation (corresponding to the highest behavioral salience) across the map. Here we report a behavioral study performed on healthy human volunteers, where we demonstrate that spatial priority maps can be shaped via reward-based learning, reflecting long-lasting alterations (biases) in the behavioral salience of specific spatial locations. These biases exert an especially strong influence on performance under conditions where multiple potential targets compete for selection, conferring competitive advantage to targets presented in spatial locations associated with greater reward during learning relative to targets presented in locations associated with lesser reward. Such acquired biases of spatial attention are persistent, are nonstrategic in nature, and generalize across stimuli and task contexts. These results suggest that reward-based attentional learning can induce plastic changes in spatial priority maps, endowing these representations with the "intelligent" capacity to learn from experience.

Keywords: cross-target competition; reward-based learning; spatial attention; spatial priority maps.

PubMed Disclaimer

Figures

**Figure 1.**
Experimental procedure. A, Illustration of the baseline and test paradigm. At the beginning of each trial, a fixation display appeared on the screen consisting of an iso-eccentric circular array of eight white squares, marking spatial locations for the upcoming stimuli. After 500 ms, the stimulus array was briefly displayed and was immediately followed by a mask. Participants were to identify one or two targets (letters or digits) among seven or six distractors (nonalphanumeric characters), respectively, by pressing the corresponding key on a standard computer keyboard. B, Illustration of the training paradigm. At the beginning of each trial, a fixation display identical to the one used in the baseline/test task appeared on the screen. After 500 ms, the stimulus array was presented on the screen for 300 ms. Participants were asked to discriminate the color of the upper triangle (either black or white) of the target stimulus as quickly and accurately as possible. Correct responses were followed by a reward feedback, which could be high or low, and the amount gained was indicated at the target location. C, Example of reward contingencies for one participant. The probability of receiving high versus low reward was predetermined and systematically biased on the basis of the specific spatial location in the display, such that each location could be assigned to one of four reward categories: *80Hh*, *50Hh*, *50Lh*, and *20Lh*. In the specific example presented here, the probability of receiving a high reward was biased in favor of the left visual hemifield, which therefore corresponded to the high-reward hemifield for this example subject.

**Figure 2.**
General performance effects. A, Training performance. Average reaction times (mean ± SEM; solid black line) and accuracy of report (mean ± SEM; dashed black line) are reported as a function of subsequent blocks of trials during the training sessions. Block 1 and block 2 belong to the first training session, whereas block 3 and block 4 belong to the second and last training session. B, Baseline performance in the single target condition. Average accuracy of report (%) in the single target condition is illustrated in the polar plot as a function of spatial location and target type. C, Baseline and test performance in the double target condition. The average incidence of different responses (double, single, and null report) across participants is illustrated (mean ± SEM), as assessed during the baseline (leftward stacked bar) and during the test (rightward stacked bar) sessions.

**Figure 3.**
Reward effects. A, Δ probability of report for critical reward-associated spatial locations in the single report condition. The histogram represents the differences in the probability of correctly reporting only the target in a *80Hh* location in *80Hh-20Lh* pairs (left) and the target in a *50Hh* location in *50Hh-50Lh* pairs (right) during the test phase with respect to the same probability in the baseline phase. B, Correlation between Δ reward and Δ probability of report. On the x-axis, the differences in the reward value associated with two targets (Δ reward) in a given pair are reported, corresponding to −60, −30, 0, 30, and 60. For illustrative purposes only, data points with a Δ reward of −30, 0, and 30 were relatively displaced from their actual x-axis coordinate in order for SEM bars not to overlap. The y-axis instead represents the difference in the probability of single reports of a given target in the pair during the test and the same probability during the baseline phase (Δ probability, mean ± SEM). Data points represented by filled squares correspond to Δ probabilities of report calculated for the theoretically “prioritized” spatial location in the pair, whereas data points represented by empty squares correspond to Δ probabilities of report calculated for the target location in the pair for which a disadvantage is expected. The red solid line indicates the fitted linear regression. C, Δ probability of a correct response in the single target condition. The difference in the probability of correctly reporting the target during the test versus baseline session (mean ± SEM) is shown as a function of reward level.

**Figure 4.**
Plasticity of the spatial priority map. The figure illustrates the average priority gain computed for each reward-associated spatial location, both in a 2D plane (middle) and in a 3D representation (top). Bottom, The array of eight spatial locations used in the experiment (Fig. 1A,B), where associated rewards were again spatially arranged on the basis of the example of reward contingencies in Figure 1C. For a given reward level, the average priority gain was calculated by averaging the Δ probability (baseline test difference) of single report for targets at the spatial location associated with that reward level, as separately calculated for all imbalanced-reward pairs containing that reward level. For example, for the two *80H* spatial locations, the priority gain, reported in the map with color coding (middle and top) and as z-axis value (top only), corresponds to the average of the Δ probabilities separately calculated in *80Hh-20Lh* and *80Hh-50 pairs*. In this case, we pooled together *80Hh-50Hh* and *80Hh-50Lh pairs* corresponding to an identical reward imbalance.

See this image and copyright information in PMC

Comment in

Priority maps explain the roles of value, attention, and salience in goal-oriented behavior.
Klink PC, Jentgens P, Lorteije JA. Klink PC, et al. J Neurosci. 2014 Oct 15;34(42):13867-9. doi: 10.1523/JNEUROSCI.3249-14.2014. J Neurosci. 2014. PMID: 25319682 Free PMC article. No abstract available.

References

1. Alvarez GA, Cavanagh P. Independent resources for attentional tracking in the left and right visual hemifields. Psychol Sci. 2005;16:637–643. doi: 10.1111/j.1467-9280.2005.01587.x. - DOI - PubMed
1. Anderson BA. A value-driven mechanism of attentional selection. J Vis. 2013;13:1–16. doi: 10.1167/13.3.7. - DOI - PMC - PubMed
1. Anderson BA, Laurent PA, Yantis S. Value-driven attentional capture. Proc Natl Acad Sci U S A. 2011a;108:10367–10371. doi: 10.1073/pnas.1104047108. - DOI - PMC - PubMed
1. Anderson BA, Laurent PA, Yantis S. Learned value magnifies salience-based attentional capture. PLoS One. 2011b;6:e27926. doi: 10.1371/journal.pone.0027926. - DOI - PMC - PubMed
1. Awh E, Belopolsky AV, Theeuwes J. Top-down versus bottom-up attentional control: a failed theoretical dichotomy. Trends Cogn Sci. 2012;16:437–443. doi: 10.1016/j.tics.2012.06.010. - DOI - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Altering spatial priority maps via reward-based learning

Affiliations

Altering spatial priority maps via reward-based learning

Authors

Affiliations

Abstract

Figures

Comment in

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials

Miscellaneous