Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jan 20;3(1):150523.
doi: 10.1098/rsos.150523. eCollection 2016 Jan.

Sharpening coarse-to-fine stereo vision by perceptual learning: asymmetric transfer across the spatial frequency spectrum

Affiliations

Sharpening coarse-to-fine stereo vision by perceptual learning: asymmetric transfer across the spatial frequency spectrum

Roger W Li et al. R Soc Open Sci. .

Abstract

Neurons in the early visual cortex are finely tuned to different low-level visual features, forming a multi-channel system analysing the visual image formed on the retina in a parallel manner. However, little is known about the potential 'cross-talk' among these channels. Here, we systematically investigated whether stereoacuity, over a large range of target spatial frequencies, can be enhanced by perceptual learning. Using narrow-band visual stimuli, we found that practice with coarse (low spatial frequency) targets substantially improves performance, and that the improvement spreads from coarse to fine (high spatial frequency) three-dimensional perception, generalizing broadly across untrained spatial frequencies and orientations. Notably, we observed an asymmetric transfer of learning across the spatial frequency spectrum. The bandwidth of transfer was broader when training was at a high spatial frequency than at a low spatial frequency. Stereoacuity training is most beneficial when trained with fine targets. This broad transfer of stereoacuity learning contrasts with the highly specific learning reported for other basic visual functions. We also revealed strategies to boost learning outcomes 'beyond-the-plateau'. Our investigations contribute to understanding the functional properties of the network subserving stereovision. The ability to generalize may provide a key principle for restoring impaired binocular vision in clinical situations.

Keywords: generalization; specificity; stereopsis; vision enhancement; visual plasticity.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Stereo stimulus. (a) The stereogram consisted of two slightly different pictures—one to each eye. At the centre each square was a target Gabor patch surrounded by four reference Gabor patches. To eliminate any possible monocular cues, the vertical and horizontal coordinates of each Gabor patch and also the patch features, the carrier phase, were randomly jittered according to a uniform distribution. A custom-built mirror stereoscope was used to view the stereo pairs, so that the left eye would see the left square and the right eye would see the right one. Binocular disparity was generated by shifting the two target Gabor patches, one on each side, horizontally in opposite directions (uncrossed disparity, both shifted temporally; crossed disparity, both shifted nasally). (b) Binocular fusion of the two monocular images creates a cyclopean image. The visual task was to determine the stereoscopic depth of the target Gabor (in front/behind) relative to the four adjacent references. This schematic diagram illustrates crossed disparity—the target Gabor patch appeared in front of the reference patches.
Figure 2.
Figure 2.
Experiment 1. Perceptual learning of stereoacuity: specificity for carrier orientation and spatial frequency. (a) The training protocol consisted of three training stages: stage 1, V5 (vertical carrier: 5 cpd); stage 2, H5 (horizontal carrier: 5 cpd); stage 3, V10 (vertical carrier: 10 cpd). A 3-parameter exponential function was used to quantify the learning profile. Mean thresholds (n=10) for each of the three stimulus configurations were measured before and after each training stage. Error bars indicate the standard error of the mean unless stated otherwise. (b) The pre- and post-training threshold data of individual observers (n=10) are illustrated in the nine figure panels: 1st row, s15 versus s1; 2nd row, s29 versus s15; 3rd row, s43 versus s29. Arrows indicate the sequence of direct training from stage 1 to 3. White panel area denotes statistically significant. Grey panel area, not statistically significant. Note that the abscissa label for each row is displayed at the right-hand end of all the panels highlighted in dark blue.
Figure 3.
Figure 3.
Experiment 2. Bandwidth of generalization across the spatial frequency spectrum. (a) Mean stereoacuity as a function of session. In stage 1, group LH (first row, n=10) was trained with V1 stimuli and group HL (second row, n=11) was trained with V20 stimuli. In stage 2, observers crossed over and trained at the untrained spatial frequency: group LH, V20; group HL, V1. A 3-parameter exponential function was used to quantify the learning profile. (b) Mean stereoacuity across spatial frequencies. (c) Per cent improvement in mean stereoacuity (I) as a function of spatial frequency (f). A 3-parameter Gaussian function, I=It×e−(1/2)(fft/σ)2, was used to quantify the generalization of stereoacuity learning across spatial frequencies, where It is the per cent improvement occurring at the trained spatial frequency (ft) and σ denotes standard deviation.
Figure 4.
Figure 4.
Enhancing coarse-to-fine stereoacuity with perceptual learning. Mean per cent improvement in stereoacuity resulting from direct training as a function of spatial frequency (stage 1 of the two experiments; V5 from the first experiment; V1 and V20 from the second experiment). There was no significant difference in per cent improvement among the three frequency groups (ANOVA: F=0.795, p=0.461).

References

    1. Westheimer G. 1979. The spatial sense of the eye. Invest. Ophthalmol. Vis. Sci. 18, 893–912. - PubMed
    1. Roe AW, Parker AJ, Born RT, DeAngelis GC. 2007. Disparity channels in early vision. J. Neurosci. 27, 11 820–11 831. (doi:10.1523/JNEUROSCI.4164-07.2007) - DOI - PMC - PubMed
    1. Anzai A, DeAngelis GC. 2010. Neural computations underlying depth perception. Curr. Opin. Neurobiol. 20, 367–375. (doi:10.1016/j.conb.2010.04.006) - DOI - PMC - PubMed
    1. Nakatsuka C, Zhang B, Watanabe I, Zheng J, Bi H, Ganz L, Smith EL, Harwerth RS, Chino YM. 2007. Effects of perceptual learning on local stereopsis and neuronal responses of V1 and V2 in prism-reared monkeys. J. Neurophysiol. 97, 2612–2626. (doi:10.1152/jn.01001.2006) - DOI - PubMed
    1. Chowdhury SA, DeAngelis GC. 2008. Fine discrimination training alters the causal contribution of macaque area MT to depth perception. Neuron 60, 367–377. (doi:10.1016/j.neuron.2008.08.023) - DOI - PMC - PubMed