Which deep learning model can best explain object representations of within-category exemplars?
- PMID: 34520508
- PMCID: PMC8444465
- DOI: 10.1167/jov.21.10.12
Which deep learning model can best explain object representations of within-category exemplars?
Abstract
Deep neural network (DNN) models realize human-equivalent performance in tasks such as object recognition. Recent developments in the field have enabled testing the hierarchical similarity of object representation between the human brain and DNNs. However, the representational geometry of object exemplars within a single category using DNNs is unclear. In this study, we investigate which DNN model has the greatest ability to explain invariant within-category object representations by computing the similarity between representational geometries of visual features extracted at the high-level layers of different DNN models. We also test for the invariability of within-category object representations of these models by identifying object exemplars. Our results show that transfer learning models based on ResNet50 best explained both within-category object representation and object identification. These results suggest that the invariability of object representations in deep learning depends not on deepening the neural network but on building a better transfer learning model.
Figures






References
-
- Ambrose, S. H. (2001). Paleolithic technology and human evolution. Science , 291(5509), 1748–1753. - PubMed
-
- Andrews, T. J., & Ewbank, M. P. (2004). Distinct representations for facial identity and changeable aspects of faces in the human temporal lobe. Neuroimage , 23(3), 905–913. - PubMed
-
- Baylis, G. C., & Driver, J. (2001). Shape-coding in IT cells generalizes over contrast and mirror reversal, but not figure-ground reversal. Nature Neuroscience , 4(9), 937–942. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources