. 2007 Sep;47(20):2585-602.

doi: 10.1016/j.visres.2007.07.003. Epub 2007 Aug 14.

Solving da Vinci stereopsis with depth-edge-selective V2 cells

Andrew Assee¹, Ning Qian

Affiliations

Affiliation

¹ Center for Neurobiology and Behavior and Department of Physiology and Cellular Biophysics, Columbia University, 1051 Riverside Drive, Box 87, Kolb Research Annex, Room 519, New York, NY 10032, USA. ada2007@columbia.edu

PMID: 17698163
PMCID: PMC2086864
DOI: 10.1016/j.visres.2007.07.003

Solving da Vinci stereopsis with depth-edge-selective V2 cells

Andrew Assee et al. Vision Res. 2007 Sep.

. 2007 Sep;47(20):2585-602.

doi: 10.1016/j.visres.2007.07.003. Epub 2007 Aug 14.

Authors

Andrew Assee¹, Ning Qian

Affiliation

¹ Center for Neurobiology and Behavior and Department of Physiology and Cellular Biophysics, Columbia University, 1051 Riverside Drive, Box 87, Kolb Research Annex, Room 519, New York, NY 10032, USA. ada2007@columbia.edu

PMID: 17698163
PMCID: PMC2086864
DOI: 10.1016/j.visres.2007.07.003

Abstract

We propose a new model for da Vinci stereopsis based on a coarse-to-fine disparity energy computation in V1 and disparity-boundary-selective units in V2. Unlike previous work, our model contains only binocular cells, relies on distributed representations of disparity, and has a simple V1-to-V2 feedforward structure. We demonstrate with random-dot stereograms that the V2 stage of our model is able to determine the location and the eye-of-origin of monocularly occluded regions, and improve disparity map computation. We also examine a few related issues. First, we argue that since monocular regions are binocularly defined, they cannot generally be detected by monocular cells. Second, we show that our coarse-to-fine V1 model for conventional stereopsis explains double matching in Panum's limiting case. This provides computational support to the notion that the perceived depth of a monocular bar next to a binocular rectangle may not be da Vinci stereopsis per se [Gillam, B., Cook, M., & Blackburn, S. (2003). Monocular discs in the occlusion zones of binocular surfaces do not have quantitative depth--a comparison with Panum's limiting case. Perception 32, 1009-1019.]. Third, we demonstrate that some stimuli previously deemed invalid have simple, valid geometric interpretations. Our work suggests that studies of da Vinci stereopsis should focus on stimuli more general than the bar-and-rectangle type and that disparity-boundary-selective V2 cells may provide a simple physiological mechanism for da Vinci stereopsis.

PubMed Disclaimer

Figures

**Figure 1**
Occlusion geometry and the role of monocular vs. binocular cells in solving da Vinci stereopsis. (A) Schematic diagram of a scene where a near surface occludes a background. The dotted lines indicate the extent to which the near surface occludes the far surface from each eye. (B) Images seen by the left and right eyes for the scene in (A) when fixation is at the near surface. (C) A special case of (B) when the binocular background is assumed to be featureless. For all panels, gray squares indicate binocular regions, and black and white squares represent left- and right-eye-only monocular regions, respectively. In (B) and (C), the dotted lines indicate correspondence between two eyes’ images. Ovals indicate the RFs of monocular cells, with dashed and solid lines representing left- and right-eye-only RFs, respectively. The vertical dimension of RFs is not shown to scale. Only for the special case in (C) can the relative responses of the left-and right-eye-only monocular cells determine the monocular regions.

**Figure 2**
Stimuli used by Nakayama and Shimojo (1990). Each stimulus is composed of a binocular rectangle and monocular bar. Nakayama and Shimojo (1990) classified the four possible configurations into two valid (A and B) and two invalid (C and D) cases. Redrawn from Nakayama and Shimojo (1990).

**Figure 3**
Schematic representation of the V1–V2 circuitry in our model. (A) An example V2 cell that receives inputs from two V1 cells with a preferred disparity of 2 pixels to the left and two V1 cells with a preferred disparity of 0 pixels to the right. The farther of the two preferred disparities, 2 pixels in this example, is termed the cell’s da Vinci disparity. (B) The tripartite receptive field of the model V2 cell in (A).

**Figure 4**
Three types of population responses of our model V2 cells. Each plot shows the responses of all V2 cells for a given image location. The peak response is below the diagonal line (A), on the diagonal line (B), or above the diagonal line (C). These three types of population responses indicate a left-eye-only monocular region, a binocular region, and a right-eye-only monocular region, respectively (see text for a detailed explanation).

**Figure 5**
Modeling double matching in Panum’s limiting case. (A) Psychophysical data of one subject reprinted from Gillam et al. (2003) with permission from Pion Limited, London. The perceived disparity of the monocular line is plotted as a function of the separation between the monocular and binocular lines. (B) Analytical results for the single-scale disparity energy model plotted for four scales with σ = 2, 4, 8, and 16 min. (C) Simulation results from the multi-scale, coarse-to-fine disparity energy model.

**Figure 6**
Valid geometric interpretations for both valid and “invalid” stimuli in Fig. 2. (A) Scene that generates the two valid cases in Fig. 2. (B) The retinal half-images of the scene in (A). (C) Scene that generates the two “invalid” cases in Fig. 2. (D) Another scene that generates the two “invalid” cases in Fig. 2. (E) The retinal half-images of the scenes in (C) and (D). Rectangles delineated by dotted lines in A, C and D represent featureless opaque surfaces. Other figure conventions are the same as in Fig. 1.

**Figure 7**
A valid geometric interpretation for an “invalid” random-dot stimulus in Shimojo and Nakayama (1990). The scene depicted in (A) generates the half-images in (B) used in Shimojo and Nakayama (1990). Figure conventions are the same as in Fig. 1.

**Figure 8**
Ocularity (eye-of-origin) maps computed at the V2 stage of our model for two random dot stereograms. (A) Simulation results from a stereogram whose middle third had a near disparity of −4 pixels and the remaining two thirds had a 0 disparity. (B) Simulation results from a stereogram whose middle third had a far disparity of 4 pixels and the remaining two thirds had a 0 disparity. In each part, the top panel shows the ideal ocularity map, with gray color representing binocular regions, and black and white colors representing left- and right-eye-only monocular regions, respectively. The bottom panel shows the raw computed ocularity map where a continuous gray scale represents the normalized difference between the two preferred disparities of the most responsive cell at each image location. Gray color represents 0 difference, and black and white colors represent negative and positive differences, respectively. A threshold can be used to convert each raw map shown here into a specific ocularity map for comparison with the corresponding ideal map.

**Figure 9**
Disparity maps computed at both the V1 and V2 stages of our model for the same random dot stereograms used in Fig. 8. (A) Results from the stereogram whose middle third has a near disparity of −4 pixels. (B) Results from the stereogram whose middle third has a far disparity of 4 pixels. In each part, the first and second rows show the V1 and V2 results, respectively. Plots in the first column show disparity as a function of the horizontal image position. The black traces are the ideal disparity maps; the monocular regions (marked by dotted vertical lines) take the disparity values of the background surfaces. The gray traces are the computed disparities. Plots in the second column are gray scale representation of computed disparities as a function of both horizontal and vertical image positions.

**Figure 10**
Effect of varying the number of V1 cells connected to a V2 cell. The percent error improvement is the difference between the V1 error (in disparity map computation) and the V2 error divided by the V1 error, averaged over 10 stereograms. Circles and squares represent results for stimuli with −4 (near) and 4 (far) pixels of disparity, respectively.

**Figure 11**
Simulations of disparity-boundary-selectivity in V2. (A) Recordings of two real disparity-boundary-selective V2 cells reprinted from (von der Heydt et al., 2000) with permission from Elsevier. Responses are shown as a function of the relative position between the cells’ RFs (small rectangles) and the central figures (large rectangles) of random dot stereograms. Each cell is tuned to a specific disparity edge in the stereograms. (B) Disparity-boundary-selectivity for two of our model V2 cells. (C) A half-maximum threshold is applied to (B) to eliminate the intermediate responses.

**Figure 12**
Simulations of two-dimensional disparity tuning of disparity-boundary-selective V2 cells. (A) Recordings of two real disparity-boundary-selective V2 cells reprinted from (von der Heydt et al., 2000) with permission from Elsevier. The RF of each cell was always aligned with a disparity boundary in a random dot stereogram (shown in the schematic to the right in the top row). Each plot shows the responses of a single V2 cell to various combinations of the foreground and background disparities. (B) Disparity tuning for two of our model V2 cells. (C) A half-maximum threshold is applied to (B) to eliminate the intermediate responses.

**Figure 13**
Ocularity map and disparity map computations of our model with the half-maximum threshold used in Figs. 11 and 12. The stereogram is the same as that in Figs. 8A and 9A, with a near disparity of −4 pixels for the middle third of the image. (A) Computed ocularity map at the V2 stage of the model. (B) Computed disparity maps at the V1 and V2 stages of the model. The figure conventions are the same as in Figs. 8 and 9.

See this image and copyright information in PMC

Cited by

Natural statistics of depth edges modulate perceptual stability.
Basgöze Z, White DN, Burge J, Cooper EA. Basgöze Z, et al. J Vis. 2020 Aug 3;20(8):10. doi: 10.1167/jov.20.8.10. J Vis. 2020. PMID: 32761107 Free PMC article.
First- and second-order contributions to depth perception in anti-correlated random dot stereograms.
Asher JM, Hibbard PB. Asher JM, et al. Sci Rep. 2018 Sep 20;8(1):14120. doi: 10.1038/s41598-018-32500-4. Sci Rep. 2018. PMID: 30237535 Free PMC article.
Synchronized audio-visual transients drive efficient visual search for motion-in-depth.
Zannoli M, Cass J, Mamassian P, Alais D. Zannoli M, et al. PLoS One. 2012;7(5):e37190. doi: 10.1371/journal.pone.0037190. Epub 2012 May 17. PLoS One. 2012. PMID: 22615939 Free PMC article.
Unmixing binocular signals.
Lehky SR. Lehky SR. Front Hum Neurosci. 2011 Aug 9;5:78. doi: 10.3389/fnhum.2011.00078. eCollection 2011. Front Hum Neurosci. 2011. PMID: 21886618 Free PMC article.
Binocular vision.
Blake R, Wilson H. Blake R, et al. Vision Res. 2011 Apr 13;51(7):754-70. doi: 10.1016/j.visres.2010.10.009. Epub 2010 Oct 15. Vision Res. 2011. PMID: 20951722 Free PMC article. Review.

See all "Cited by" articles

References

1. Anderson BL. The role of partial occlusion in stereopsis. Nature. 1994;367:365–368. - PubMed
1. Assee A, Qian N. Solving da Vinci Stereopsis with V2 Disparity-Boundary-Selective Cells. Society for Neuroscience Meeting 2006
1. Burkhalter A, Van Essen DC. Processing of color, form and disparity information in visual areas VP and V2 of ventral extrastriate cortex in the macaque monkey. J Neurosci. 1986;6:2327–2351. - PMC - PubMed
1. Cao Y, Grossberg S. A laminar cortical model of stereopsis and 3D surface perception: closure and da Vinci stereopsis. Spat Vis. 2005;18:515–578. - PubMed
1. Chen Y, Qian N. A coarse-to-fine disparity energy model with both phase-shift and position-shift receptive field mechanisms. Neural Comput. 2004;16:1545–1577. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Solving da Vinci stereopsis with depth-edge-selective V2 cells

Affiliation

Solving da Vinci stereopsis with depth-edge-selective V2 cells

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources