Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Dec;26(12):1119-1132.
doi: 10.1016/j.tics.2022.09.019. Epub 2022 Oct 19.

Does the brain's ventral visual pathway compute object shape?

Affiliations
Review

Does the brain's ventral visual pathway compute object shape?

Vladislav Ayzenberg et al. Trends Cogn Sci. 2022 Dec.

Abstract

A rich behavioral literature has shown that human object recognition is supported by a representation of shape that is tolerant to variations in an object's appearance. Such 'global' shape representations are achieved by describing objects via the spatial arrangement of their local features, or structure, rather than by the appearance of the features themselves. However, accumulating evidence suggests that the ventral visual pathway - the primary substrate underlying object recognition - may not represent global shape. Instead, ventral representations may be better described as a basis set of local image features. We suggest that this evidence forces a reevaluation of the role of the ventral pathway in object perception and posits a broader network for shape perception that encompasses contributions from the dorsal pathway.

Keywords: deep neural networks; dorsal stream; object recognition; shape perception; ventral stream; viewpoint-invariance.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests No interests are declared.

Figures

Figure 1.
Figure 1.. Object images with intact and disrupted global shapes.
(A) Examples of airplanes where the spatial arrangement of features is preserved. Despite radically different local features, a common spatial arrangement, or structure, elicits the same percept of shape. (B) Examples of airplanes where the spatial arrangement of features is disrupted or partially occluded via foreshortening.
Figure 2.
Figure 2.. Slight image perturbations change the firing rate of individual ventral neurons.
(A) Example of a clean preferred (red) and non-preferred (blue) image categories for an individual neuron, as well as non-preferred images with various degrees of perturbation as specified by ε. (B) Raster plot illustrating the firing rate of the neuron in response to its preferred category (red), as well as to a non-preferred category (blue) following different degrees of perturbation. (C) The normalized (norm.) firing rate to preferred images (dashed red line) and non-preferred images (solid blue line) following different degrees of perturbation. By ε = 10, the firing to the non-preferred image category exceeds the preferred image category. Figure adapted, with permission, from [58].
Figure 3.
Figure 3.. Greater sensitivity to local features than global shape in the ventral pathway.
(A) Example stimuli and results from [60] as measured behaviorally from human observers (top), the human ventral visual pathway (middle), and deep neural networks (DNNs, bottom). Each image triplet depicts the multidimensional scaling (MDS) of image similarities. The top two images of each triplet depict synthetic images where the arrangement of features has been scrambled, and the bottom image of each triplet depicts the original image where the arrangement of features remains intact. The distances between each image in a triplet reflect their similarities. (Top) Human observers readily grouped feature scrambled images together and discriminated them from intact images. By contrast, the multivariate responses of the (middle) ventral pathway and (bottom) DNNs showed no such grouping, with equal distances between each image. (B) Example stimuli and results from [61]. (Top) Texform examples are shown alongside their real-world counterparts for animate/inanimate and large/small categories. (Bottom) Ventral pathway activation maps for the original images and their texform counterparts. Although unrecognizable by human observers, texforms elicit the same large-scale topographic organization along the dimensions of object animacy and size as real-world objects.
Figure 4.
Figure 4.. An expanded brain network for object recognition.
In this schematic depiction of the visual system, the ventral pathway (V1 to ATL) acts much like a DNN (bottom) – extracting increasingly complex local object features, but not a complete shape. Instead, structural information describing the global shape of an object, but not its individual features (top; depicted as a red skeleton), may be computed in dorsal visual pathway regions such as IPS. This information is then sent to the ventral pathway to form a complete object representation. Abbreviations: ATL, anterior temporal lobe; DNN, deep neural network; IPS, intraparietal sulcus; L1–L6, layers 1–6; LOC, lateral occipital complex; V1–V4, visual areas 1–4.

References

    1. Wagemans J et al. (2008) Identification of everyday objects on the basis of silhouette and outline versions. Perception 37, 207–244 - PubMed
    1. Elder JH and Velisavljević L (2009) Cue dynamics underlying rapid detection of animals in natural scenes. J. Vision 9, 1–20 - PubMed
    1. Biederman I and Ju G (1988) Surface versus edge-based determinants of visual recognition. Cogn. Psychol. 20, 38–64 - PubMed
    1. Ayzenberg V and Lourenco S (2022) Perception of an objecťs global shape is best described by a model of skeletal structure in human infants. Elife 11, e74943. - PMC - PubMed
    1. Quinn PC et al. (2001) Perceptual categorization of cat and dog silhouettes by 3-to 4-month-old infants. J. Exp. Child Psychol. 79, 78–94 - PubMed

Publication types

LinkOut - more resources