Comparative Study

. 2020 Mar 13;10(1):4638.

doi: 10.1038/s41598-020-61409-0.

Emergence of Visual Center-Periphery Spatial Organization in Deep Convolutional Neural Networks

Yalda Mohsenzadeh^{1

2

3}, Caitlin Mullin⁴, Benjamin Lahner⁵, Aude Oliva⁵

Affiliations

¹ Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge, MA, USA. ymohsenz@uwo.ca.
² Department of Computer Science, The University of Western Ontario, London, ON, Canada. ymohsenz@uwo.ca.
³ The Brain and Mind Institute, The University of Western Ontario, London, ON, Canada. ymohsenz@uwo.ca.
⁴ Department of Psychology, Center for Vision Research, York University, Toronto, ON, Canada.
⁵ Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge, MA, USA.

PMID: 32170209
PMCID: PMC7070097
DOI: 10.1038/s41598-020-61409-0

Comparative Study

Emergence of Visual Center-Periphery Spatial Organization in Deep Convolutional Neural Networks

Yalda Mohsenzadeh et al. Sci Rep. 2020.

. 2020 Mar 13;10(1):4638.

doi: 10.1038/s41598-020-61409-0.

Authors

Yalda Mohsenzadeh^{1

2

3}, Caitlin Mullin⁴, Benjamin Lahner⁵, Aude Oliva⁵

Affiliations

¹ Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge, MA, USA. ymohsenz@uwo.ca.
² Department of Computer Science, The University of Western Ontario, London, ON, Canada. ymohsenz@uwo.ca.
³ The Brain and Mind Institute, The University of Western Ontario, London, ON, Canada. ymohsenz@uwo.ca.
⁴ Department of Psychology, Center for Vision Research, York University, Toronto, ON, Canada.
⁵ Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge, MA, USA.

PMID: 32170209
PMCID: PMC7070097
DOI: 10.1038/s41598-020-61409-0

Abstract

Research at the intersection of computer vision and neuroscience has revealed hierarchical correspondence between layers of deep convolutional neural networks (DCNNs) and cascade of regions along human ventral visual cortex. Recently, studies have uncovered emergence of human interpretable concepts within DCNNs layers trained to identify visual objects and scenes. Here, we asked whether an artificial neural network (with convolutional structure) trained for visual categorization would demonstrate spatial correspondences with human brain regions showing central/peripheral biases. Using representational similarity analysis, we compared activations of convolutional layers of a DCNN trained for object and scene categorization with neural representations in human brain visual regions. Results reveal a brain-like topographical organization in the layers of the DCNN, such that activations of layer-units with central-bias were associated with brain regions with foveal tendencies (e.g. fusiform gyrus), and activations of layer-units with selectivity for image backgrounds were associated with cortical regions showing peripheral preference (e.g. parahippocampal cortex). The emergence of a categorical topographical correspondence between DCNNs and brain regions suggests these models are a good approximation of the perceptual representation generated by biological neural networks.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Figure 1**
Hierarchical correspondences between layers of DCNN and brain regions of interest along ventral visual pathway. (A) For each image, the activation of units in each of the 5 convolutional layers are vectorized. RDM representation for each layer is created by computing the pairwise distance of these image specific vector patterns (1-Pearson Corr). Then fMRI RDM representations in EVC, Fusiform, IT and PHC areas are compared with the RDM representations of each convolutional layer of Hybrid-CNN by computing Spearman’s correlations. (B) Neural representations along ventral visual pathway. RDM matrices, and 2D multidimensional scaling visualization of stimuli depicted for early visual cortex (EVC), fusiform gyrus (Fusiform), inferior temporal cortex (IT) and parahippocampal cortex (PHC). (C) The correlation values for brain ROIs and layers of DCNN are depicted with bar plots. The error bars indicate the standard error of the mean and the stars above each bar indicates significant correlation above zero (N = 15, P < 0.05, Bonferroni-corrected). The noise ceiling for each brain area is reported on the right side of the panel. The pictures used in this figure are not examples of the stimulus set due to copyright.

**Figure 2**
Creating topographical correlation maps. We extract the 3D activation patterns from the network convolutional layers. The first 2 Dimensions have a spatial relation with the image space (width and height). At each (x, y) position in feature maps, we extract a pattern vector with the length equivalent to the depth and construct the RDM matrix from the neural network activity patterns at each (x, y) location. Comparison of these RDM matrices with a brain ROI RDM results in a 2D correlation map which we then up-sample it to the image size (topographical map). The pictures used in this figure are not examples of the stimulus set due to copyright.

**Figure 3**
Topographical correspondence between convolutional layers of DCNNs and human ventral visual regions. For each brain-model mapping (EVC, Fusiform, IT, PHC), the first five maps show the correlational topographical maps between each convolutional layer and the brain ROI; the second five maps show the corresponding significance maps (two-sided sign permutation tests, cluster defining threshold P < 0.01, and corrected significance level P < 0.05). The topographical correlation maps in this figure are computed following the method depicted in Fig. 2. For detailed description of RDM computations and correlations please see the Method section.

See this image and copyright information in PMC

Cited by

Perceptual Expertise and Attention: An Exploration using Deep Neural Networks.
Das S, Mangun GR, Ding M. Das S, et al. bioRxiv [Preprint]. 2024 Oct 16:2024.10.15.617743. doi: 10.1101/2024.10.15.617743. bioRxiv. 2024. PMID: 39464001 Free PMC article. Preprint.
Acute Angiotensin II Receptor Blockade Facilitates Parahippocampal Processing During Memory Encoding in High-Trait-Anxious Individuals.
Shkreli L, Thoroddsen T, Kobelt M, Martens MAG, Browning M, Harmer CJ, Cowen P, Reinecke A. Shkreli L, et al. Biol Psychiatry Glob Open Sci. 2023 Dec 25;4(2):100286. doi: 10.1016/j.bpsgos.2023.100286. eCollection 2024 Mar. Biol Psychiatry Glob Open Sci. 2023. PMID: 38323154 Free PMC article.
Will We Ever Have Conscious Machines?
Krauss P, Maier A. Krauss P, et al. Front Comput Neurosci. 2020 Dec 22;14:556544. doi: 10.3389/fncom.2020.556544. eCollection 2020. Front Comput Neurosci. 2020. PMID: 33414712 Free PMC article.
Evaluating large language models in theory of mind tasks.
Kosinski M. Kosinski M. Proc Natl Acad Sci U S A. 2024 Nov 5;121(45):e2405460121. doi: 10.1073/pnas.2405460121. Epub 2024 Oct 29. Proc Natl Acad Sci U S A. 2024. PMID: 39471222 Free PMC article.
Reconstructing feedback representations in the ventral visual pathway with a generative adversarial autoencoder.
Al-Tahan H, Mohsenzadeh Y. Al-Tahan H, et al. PLoS Comput Biol. 2021 Mar 24;17(3):e1008775. doi: 10.1371/journal.pcbi.1008775. eCollection 2021 Mar. PLoS Comput Biol. 2021. PMID: 33760819 Free PMC article.

See all "Cited by" articles

References

1. Grill-Spector K, Weiner KS. The functional architecture of the ventral temporal cortex and its role in categorization. Nature Reviews Neuroscience. 2014;15:536–548. doi: 10.1038/nrn3747. - DOI - PMC - PubMed
1. Kanwisher N, McDermott J, Chun MM. The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception. The Journal of Neuroscience. 1997;17:4302–4311. doi: 10.1523/JNEUROSCI.17-11-04302.1997. - DOI - PMC - PubMed
1. Epstein R, Kanwisher N. A cortical representation of the local visual environment. Nature. 1998;392:598–601. doi: 10.1038/33402. - DOI - PubMed
1. Epstein R, Harris A, Stanley D, Kanwisher N. The Parahippocampal Place Area: Recognition, Navigation, or Encoding? Neuron. 1999;23:115–125. doi: 10.1016/S0896-6273(00)80758-8. - DOI - PubMed
1. Konkle T, Caramazza A. Tripartite Organization of the Ventral Stream by Animacy and Object Size. Journal of Neuroscience. 2013;33:10235–10242. doi: 10.1523/JNEUROSCI.0983-13.2013. - DOI - PMC - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Emergence of Visual Center-Periphery Spatial Organization in Deep Convolutional Neural Networks

Affiliations

Emergence of Visual Center-Periphery Spatial Organization in Deep Convolutional Neural Networks

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources