Brain mechanisms for invariant visual recognition and learning

E T Rolls¹

Affiliations

PMID: 24925242
DOI: 10.1016/0376-6357(94)90062-0

Brain mechanisms for invariant visual recognition and learning

E T Rolls. Behav Processes. 1994 Dec.

. 1994 Dec;33(1-2):113-38.

doi: 10.1016/0376-6357(94)90062-0. Epub 2002 May 31.

Author

E T Rolls¹

Affiliation

¹ Oxford University, Department of Experimental Psychology, South Parks Road, Oxford OX1 3UD, UK.

PMID: 24925242
DOI: 10.1016/0376-6357(94)90062-0

Abstract

Mechanisms by which the brain could perform invariant recognition of objects including faces are addressed neurophysiologically, and then a computational model of how this could occur is described. Some neurons that respond primarily to faces are found in the macaque cortex in the anterior part of the superior temporal sulcus (in which region neurons are especially likely to be tuned to facial expression, and to face movement involved in gesture). They are also found more ventrally in the TE areas which form the inferior temporal gyrus. Here the neurons are more likely to have responses related to the identity of faces. These areas project on to the amygdala and orbitofrontal cortex, in which face-selective neurons are also found. Quantitative studies of the responses of the neurons that respond differently to the faces of different individuals show that information about the identity of the individual is represented by the responses of a population of neurons, that is, ensemble encoding is used. The rather distributed encoding (within the class faces) about identity in these sensory cortical regions has the advantages of maximising the information in the representation useful for discrimination between stimuli, generalisation, and graceful degradation. In contrast, the more sparse representations in structures such as the hippocampus may be useful to maximise the number of different memories stored. There is evidence that the responses of some of these neurons are altered by experience so that new stimuli become incorporated in the network, in only a few seconds of experience with a new stimulus. It is shown that the representation that is built in temporal cortical areas shows considerable invariance for size, contrast, spatial frequency and translation. Thus the representation is in a form which is particularly useful for storage and as an output from the visual system. It is also shown that one of the representations which is built is view-in-variant, which is suitable for recognition and as an input to associative memory. Another is viewer-centered, which is appropriate for conveying information about gesture. It is shown that these computational processes operate rapidly, in that in a backward masking paradigm, 20-40 ms of neuronal activity in a cortical area is sufficient to support face recognition. In a clinical application of these findings, it is shown that humans with ventral frontal lobe damage have in some cases impairments in face and voice expression identification. These impairments are correlated with and may contribute to the problems some of these patients have in emotional and social behaviour. To help provide an understanding of how the invariant recognition described could be performed by the brain, a neuronal network model of processing in the ventral visual system is described. The model uses a multistage feed-forward architecture, and is able to learn invariant representations of objects including faces by use of a Hebbian synaptic modification rule which incorporates a short memory trace (0.5 s) of preceding activity to enable the network to learn the properties of objects which are spatio-temporally invariant over this time scale.

PubMed Disclaimer

Cited by

The responses of neurons in the temporal cortex of primates, and face identification and detection.
Rolls ET, Tovee MJ, Purcell DG, Stewart AL, Azzopardi P. Rolls ET, et al. Exp Brain Res. 1994;101(3):473-84. doi: 10.1007/BF00227340. Exp Brain Res. 1994. PMID: 7851514
Use of superordinate labels yields more robust and human-like visual representations in convolutional neural networks.
Ahn S, Zelinsky GJ, Lupyan G. Ahn S, et al. J Vis. 2021 Dec 1;21(13):13. doi: 10.1167/jov.21.13.13. J Vis. 2021. PMID: 34967860 Free PMC article.
Invariant Visual Object and Face Recognition: Neural and Computational Bases, and a Model, VisNet.
Rolls ET. Rolls ET. Front Comput Neurosci. 2012 Jun 19;6:35. doi: 10.3389/fncom.2012.00035. eCollection 2012. Front Comput Neurosci. 2012. PMID: 22723777 Free PMC article.
Receptive field properties of the macaque second somatosensory cortex: nonlinear mechanisms underlying the representation of orientation within a finger pad.
Thakur PH, Fitzgerald PJ, Lane JW, Hsiao SS. Thakur PH, et al. J Neurosci. 2006 Dec 27;26(52):13567-75. doi: 10.1523/JNEUROSCI.3990-06.2006. J Neurosci. 2006. PMID: 17192440 Free PMC article.
Information in the neuronal representation of individual stimuli in the primate temporal visual cortex.
Rolls ET, Treves A, Tovee MJ, Panzeri S. Rolls ET, et al. J Comput Neurosci. 1997 Nov;4(4):309-33. doi: 10.1023/a:1008899916425. J Comput Neurosci. 1997. PMID: 9427118

See all "Cited by" articles

LinkOut - more resources

Full Text Sources
- Elsevier Science
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Brain mechanisms for invariant visual recognition and learning

Affiliation

Brain mechanisms for invariant visual recognition and learning

Author

Affiliation

Abstract

Similar articles

Cited by

LinkOut - more resources

Full Text Sources

Other Literature Sources