Mid-level visual features underlie the high-level categorical organization of the ventral stream
- PMID: 30171168
- PMCID: PMC6156638
- DOI: 10.1073/pnas.1719616115
Mid-level visual features underlie the high-level categorical organization of the ventral stream
Abstract
Human object-selective cortex shows a large-scale organization characterized by the high-level properties of both animacy and object size. To what extent are these neural responses explained by primitive perceptual features that distinguish animals from objects and big objects from small objects? To address this question, we used a texture synthesis algorithm to create a class of stimuli-texforms-which preserve some mid-level texture and form information from objects while rendering them unrecognizable. We found that unrecognizable texforms were sufficient to elicit the large-scale organizations of object-selective cortex along the entire ventral pathway. Further, the structure in the neural patterns elicited by texforms was well predicted by curvature features and by intermediate layers of a deep convolutional neural network, supporting the mid-level nature of the representations. These results provide clear evidence that a substantial portion of ventral stream organization can be accounted for by coarse texture and form information without requiring explicit recognition of intact objects.
Keywords: deep neural networks; fMRI; mid-level features; object recognition; ventral stream organization.
Conflict of interest statement
The authors declare no conflict of interest.
Figures






References
-
- DiCarlo JJ, Cox DD. Untangling invariant object recognition. Trends Cogn Sci. 2007;11:333–341. - PubMed
-
- Mishkin M, Ungerleider LG, Macko KA. Object vision and spatial vision: Two cortical pathways. Trends Neurosci. 1983;6:414–417.
-
- Cohen L, et al. The visual word form area: Spatial and temporal characterization of an initial stage of reading in normal subjects and posterior split-brain patients. Brain. 2000;123:291–307. - PubMed
-
- Downing PE, Jiang Y, Shuman M, Kanwisher N. A cortical area selective for visual processing of the human body. Science. 2001;293:2470–2473. - PubMed
-
- Epstein R, Kanwisher N. A cortical representation of the local visual environment. Nature. 1998;392:598–601. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources