Brain mechanisms for invariant visual recognition and learning
- PMID: 24925242
- DOI: 10.1016/0376-6357(94)90062-0
Brain mechanisms for invariant visual recognition and learning
Abstract
Mechanisms by which the brain could perform invariant recognition of objects including faces are addressed neurophysiologically, and then a computational model of how this could occur is described. Some neurons that respond primarily to faces are found in the macaque cortex in the anterior part of the superior temporal sulcus (in which region neurons are especially likely to be tuned to facial expression, and to face movement involved in gesture). They are also found more ventrally in the TE areas which form the inferior temporal gyrus. Here the neurons are more likely to have responses related to the identity of faces. These areas project on to the amygdala and orbitofrontal cortex, in which face-selective neurons are also found. Quantitative studies of the responses of the neurons that respond differently to the faces of different individuals show that information about the identity of the individual is represented by the responses of a population of neurons, that is, ensemble encoding is used. The rather distributed encoding (within the class faces) about identity in these sensory cortical regions has the advantages of maximising the information in the representation useful for discrimination between stimuli, generalisation, and graceful degradation. In contrast, the more sparse representations in structures such as the hippocampus may be useful to maximise the number of different memories stored. There is evidence that the responses of some of these neurons are altered by experience so that new stimuli become incorporated in the network, in only a few seconds of experience with a new stimulus. It is shown that the representation that is built in temporal cortical areas shows considerable invariance for size, contrast, spatial frequency and translation. Thus the representation is in a form which is particularly useful for storage and as an output from the visual system. It is also shown that one of the representations which is built is view-in-variant, which is suitable for recognition and as an input to associative memory. Another is viewer-centered, which is appropriate for conveying information about gesture. It is shown that these computational processes operate rapidly, in that in a backward masking paradigm, 20-40 ms of neuronal activity in a cortical area is sufficient to support face recognition. In a clinical application of these findings, it is shown that humans with ventral frontal lobe damage have in some cases impairments in face and voice expression identification. These impairments are correlated with and may contribute to the problems some of these patients have in emotional and social behaviour. To help provide an understanding of how the invariant recognition described could be performed by the brain, a neuronal network model of processing in the ventral visual system is described. The model uses a multistage feed-forward architecture, and is able to learn invariant representations of objects including faces by use of a Hebbian synaptic modification rule which incorporates a short memory trace (0.5 s) of preceding activity to enable the network to learn the properties of objects which are spatio-temporally invariant over this time scale.
Copyright © 1994. Published by Elsevier B.V.
Similar articles
-
Neurophysiological mechanisms underlying face processing within and beyond the temporal cortical visual areas.Philos Trans R Soc Lond B Biol Sci. 1992 Jan 29;335(1273):11-20; discussion 20-1. doi: 10.1098/rstb.1992.0002. Philos Trans R Soc Lond B Biol Sci. 1992. PMID: 1348130 Review.
-
The representation of information about faces in the temporal and frontal lobes.Neuropsychologia. 2007 Jan 7;45(1):124-43. doi: 10.1016/j.neuropsychologia.2006.04.019. Epub 2006 Jun 23. Neuropsychologia. 2007. PMID: 16797609 Review.
-
Face processing in different brain areas, and critical band masking.J Neuropsychol. 2008 Sep;2(2):325-60. doi: 10.1348/174866407x258903. J Neuropsychol. 2008. PMID: 19824174
-
Learning mechanisms in the temporal lobe visual cortex.Behav Brain Res. 1995 Jan 23;66(1-2):177-85. doi: 10.1016/0166-4328(94)00138-6. Behav Brain Res. 1995. PMID: 7755888 Review.
-
[Neural representations of facial identity and its associative meaning].Brain Nerve. 2012 Jul;64(7):841-52. Brain Nerve. 2012. PMID: 22764356 Review. Japanese.
Cited by
-
The responses of neurons in the temporal cortex of primates, and face identification and detection.Exp Brain Res. 1994;101(3):473-84. doi: 10.1007/BF00227340. Exp Brain Res. 1994. PMID: 7851514
-
Use of superordinate labels yields more robust and human-like visual representations in convolutional neural networks.J Vis. 2021 Dec 1;21(13):13. doi: 10.1167/jov.21.13.13. J Vis. 2021. PMID: 34967860 Free PMC article.
-
Invariant Visual Object and Face Recognition: Neural and Computational Bases, and a Model, VisNet.Front Comput Neurosci. 2012 Jun 19;6:35. doi: 10.3389/fncom.2012.00035. eCollection 2012. Front Comput Neurosci. 2012. PMID: 22723777 Free PMC article.
-
Receptive field properties of the macaque second somatosensory cortex: nonlinear mechanisms underlying the representation of orientation within a finger pad.J Neurosci. 2006 Dec 27;26(52):13567-75. doi: 10.1523/JNEUROSCI.3990-06.2006. J Neurosci. 2006. PMID: 17192440 Free PMC article.
-
Information in the neuronal representation of individual stimuli in the primate temporal visual cortex.J Comput Neurosci. 1997 Nov;4(4):309-33. doi: 10.1023/a:1008899916425. J Comput Neurosci. 1997. PMID: 9427118
LinkOut - more resources
Full Text Sources
Other Literature Sources