Do we understand high-level vision?
- PMID: 24552691
- DOI: 10.1016/j.conb.2014.01.016
Do we understand high-level vision?
Abstract
'High-level' vision lacks a single, agreed upon definition, but it might usefully be defined as those stages of visual processing that transition from analyzing local image structure to analyzing structure of the external world that produced those images. Much work in the last several decades has focused on object recognition as a framing problem for the study of high-level visual cortex, and much progress has been made in this direction. This approach presumes that the operational goal of the visual system is to read-out the identity of an object (or objects) in a scene, in spite of variation in the position, size, lighting and the presence of other nearby objects. However, while object recognition as a operational framing of high-level is intuitive appealing, it is by no means the only task that visual cortex might do, and the study of object recognition is beset by challenges in building stimulus sets that adequately sample the infinite space of possible stimuli. Here I review the successes and limitations of this work, and ask whether we should reframe our approaches to understanding high-level vision.
Copyright © 2014. Published by Elsevier Ltd.
Similar articles
-
Deep hierarchies in the primate visual cortex: what can we learn for computer vision?IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1847-71. doi: 10.1109/TPAMI.2012.272. IEEE Trans Pattern Anal Mach Intell. 2013. PMID: 23787340 Review.
-
Acquisition of nonlinear forward optics in generative models: two-stage "downside-up" learning for occluded vision.Neural Netw. 2011 Mar;24(2):148-58. doi: 10.1016/j.neunet.2010.10.004. Epub 2010 Oct 27. Neural Netw. 2011. PMID: 21094592 Review.
-
Recognition invariance obtained by extended and invariant features.Neural Netw. 2004 Jun-Jul;17(5-6):833-48. doi: 10.1016/j.neunet.2004.01.006. Neural Netw. 2004. PMID: 15288901
-
Nonaccidental properties underlie shape recognition in Mammalian and nonmammalian vision.Curr Biol. 2007 Feb 20;17(4):336-40. doi: 10.1016/j.cub.2006.12.025. Epub 2007 Feb 1. Curr Biol. 2007. PMID: 17275301
-
How position dependent is visual object recognition?Trends Cogn Sci. 2008 Mar;12(3):114-22. doi: 10.1016/j.tics.2007.12.006. Epub 2008 Feb 11. Trends Cogn Sci. 2008. PMID: 18262829 Review.
Cited by
-
Object size determines the spatial spread of visual time.Proc Biol Sci. 2016 Jul 27;283(1835):20161024. doi: 10.1098/rspb.2016.1024. Proc Biol Sci. 2016. PMID: 27466452 Free PMC article.
-
The Role of Visual Factors in Dyslexia.J Cogn. 2023 Jun 29;6(1):31. doi: 10.5334/joc.287. eCollection 2023. J Cogn. 2023. PMID: 37397349 Free PMC article.
-
Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream.J Neurosci. 2015 Jul 8;35(27):10005-14. doi: 10.1523/JNEUROSCI.5023-14.2015. J Neurosci. 2015. PMID: 26157000 Free PMC article.
-
Strategically managing learning during perceptual decision making.Elife. 2023 Feb 14;12:e64978. doi: 10.7554/eLife.64978. Elife. 2023. PMID: 36786427 Free PMC article.
-
The receptive field is dead. Long live the receptive field?Curr Opin Neurobiol. 2014 Apr;25:ix-xii. doi: 10.1016/j.conb.2014.02.001. Epub 2014 Mar 4. Curr Opin Neurobiol. 2014. PMID: 24618227 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources