Sparse-Coding Variational Autoencoders

Victor Geadah¹, Gabriel Barello², Daniel Greenidge³, Adam S Charles⁴, Jonathan W Pillow⁵

Affiliations

¹ Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ 08544, U.S.A. victor.geadah@princeton.edu.
² Institute of Neuroscience, University of Oregon, Eugene, OR 97403, U.S.A. gabrielbarello@gmail.com.
³ Department of Computer Science, Princeton University, Princeton, NJ 08544, U.S.A. daniel@danielgreenidge.com.
⁴ Department of Biomedical Engineering, Department Center for Imaging Science, and Department Kavli Neuroscience Discovery Institute, Baltimore, MD 21218, U.S.A. adamsc@jhu.edu.
⁵ Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, U.S.A. pillow@princeton.edu.

PMID: 39383030
DOI: 10.1162/neco_a_01715

Sparse-Coding Variational Autoencoders

Victor Geadah et al. Neural Comput. 2024.

. 2024 Nov 19;36(12):2571-2601.

doi: 10.1162/neco_a_01715.

Authors

Victor Geadah¹, Gabriel Barello², Daniel Greenidge³, Adam S Charles⁴, Jonathan W Pillow⁵

Affiliations

¹ Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ 08544, U.S.A. victor.geadah@princeton.edu.
² Institute of Neuroscience, University of Oregon, Eugene, OR 97403, U.S.A. gabrielbarello@gmail.com.
³ Department of Computer Science, Princeton University, Princeton, NJ 08544, U.S.A. daniel@danielgreenidge.com.
⁴ Department of Biomedical Engineering, Department Center for Imaging Science, and Department Kavli Neuroscience Discovery Institute, Baltimore, MD 21218, U.S.A. adamsc@jhu.edu.
⁵ Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, U.S.A. pillow@princeton.edu.

PMID: 39383030
DOI: 10.1162/neco_a_01715

Abstract

The sparse coding model posits that the visual system has evolved to efficiently code natural stimuli using a sparse set of features from an overcomplete dictionary. The original sparse coding model suffered from two key limitations; however: (1) computing the neural response to an image patch required minimizing a nonlinear objective function via recurrent dynamics and (2) fitting relied on approximate inference methods that ignored uncertainty. Although subsequent work has developed several methods to overcome these obstacles, we propose a novel solution inspired by the variational autoencoder (VAE) framework. We introduce the sparse coding variational autoencoder (SVAE), which augments the sparse coding model with a probabilistic recognition model parameterized by a deep neural network. This recognition model provides a neurally plausible feedforward implementation for the mapping from image patches to neural activities and enables a principled method for fitting the sparse coding model to data via maximization of the evidence lower bound (ELBO). The SVAE differs from standard VAEs in three key respects: the latent representation is overcomplete (there are more latent dimensions than image pixels), the prior is sparse or heavy-tailed instead of gaussian, and the decoder network is a linear projection instead of a deep network. We fit the SVAE to natural image data under different assumed prior distributions and show that it obtains higher test performance than previous fitting methods. Finally, we examine the response properties of the recognition network and show that it captures important nonlinear properties of neurons in the early visual pathway.

PubMed Disclaimer

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Silverchair Information Systems

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Sparse-Coding Variational Autoencoders

Affiliations

Sparse-Coding Variational Autoencoders

Authors

Affiliations

Abstract

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources