From convolutional neural networks to models of higher-level cognition (and back again)
- PMID: 33754368
- PMCID: PMC9292363
- DOI: 10.1111/nyas.14593
From convolutional neural networks to models of higher-level cognition (and back again)
Abstract
The remarkable successes of convolutional neural networks (CNNs) in modern computer vision are by now well known, and they are increasingly being explored as computational models of the human visual system. In this paper, we ask whether CNNs might also provide a basis for modeling higher-level cognition, focusing on the core phenomena of similarity and categorization. The most important advance comes from the ability of CNNs to learn high-dimensional representations of complex naturalistic images, substantially extending the scope of traditional cognitive models that were previously only evaluated with simple artificial stimuli. In all cases, the most successful combinations arise when CNN representations are used with cognitive models that have the capacity to transform them to better fit human behavior. One consequence of these insights is a toolkit for the integration of cognitively motivated constraints back into CNN training paradigms in computer vision and machine learning, and we review cases where this leads to improved performance. A second consequence is a roadmap for how CNNs and cognitive models can be more fully integrated in the future, allowing for flexible end-to-end algorithms that can learn representations from data while still retaining the structured behavior characteristic of human cognition.
Keywords: categorization; cognitive modeling; convolutional neural networks; similarity; vision.
© 2021 New York Academy of Sciences.
Conflict of interest statement
The authors declare no competing interests.
Figures
References
-
- Krizhevsky, A. , Sutskever I. & Hinton G.E.. 2012. ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems 1097–1105.
-
- Baxter, J. 2000. A model of inductive bias learning. J. Artif. Intell. Res. 12: 149–198.
-
- Russakovsky, O. et al. 2015. ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115: 211–252.
-
- Duta, I.C. , Liu L., Zhu F. & Shao L.. 2020. Pyramidal convolution: rethinking convolutional neural networks for visual recognition. arXiv preprint arXiv:2006.11538.
-
- Lin, T.‐Y. et al. 2014. Microsoft COCO: common objects in context. In European Conference on Computer Vision 740–755.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
