Spiking Autoencoders With Temporal Coding

Iulia-Maria Comşa¹, Luca Versari¹, Thomas Fischbacher¹, Jyrki Alakuijala¹

Affiliations

PMID: 34483829
PMCID: PMC8414972
DOI: 10.3389/fnins.2021.712667

Spiking Autoencoders With Temporal Coding

Iulia-Maria Comşa et al. Front Neurosci. 2021.

. 2021 Aug 13:15:712667.

doi: 10.3389/fnins.2021.712667. eCollection 2021.

Authors

Iulia-Maria Comşa¹, Luca Versari¹, Thomas Fischbacher¹, Jyrki Alakuijala¹

Affiliation

¹ Google Research, Zürich, Switzerland.

PMID: 34483829
PMCID: PMC8414972
DOI: 10.3389/fnins.2021.712667

Abstract

Spiking neural networks with temporal coding schemes process information based on the relative timing of neuronal spikes. In supervised learning tasks, temporal coding allows learning through backpropagation with exact derivatives, and achieves accuracies on par with conventional artificial neural networks. Here we introduce spiking autoencoders with temporal coding and pulses, trained using backpropagation to store and reconstruct images with high fidelity from compact representations. We show that spiking autoencoders with a single layer are able to effectively represent and reconstruct images from the neuromorphically-encoded MNIST and FMNIST datasets. We explore the effect of different spike time target latencies, data noise levels and embedding sizes, as well as the classification performance from the embeddings. The spiking autoencoders achieve results similar to or better than conventional non-spiking autoencoders. We find that inhibition is essential in the functioning of the spiking autoencoders, particularly when the input needs to be memorised for a longer time before the expected output spike times. To reconstruct images with a high target latency, the network learns to accumulate negative evidence and to use the pulses as excitatory triggers for producing the output spikes at the required times. Our results highlight the potential of spiking autoencoders as building blocks for more complex biologically-inspired architectures. We also provide open-source code for the model.

Keywords: autoencoders; backpropagation; biologically-inspired artificial intelligence; inhibition; latency coding; spiking networks; temporal coding.

PubMed Disclaimer

Conflict of interest statement

All authors were employed by Google Research, Switzerland. Parts of the ideas presented here are covered by pending PCT Patent Application No. PCT/US2019/055848 (Temporal Coding in Leaky Spiking Neural Networks), filed by Google in 2019.

Figures

**Figure 1**
Illustration of membrane potential dynamics for a neuron with θ = 0.5 and τ = 1. The neuron receives input spikes at times t_i ∈ {1, 4, 5, 8, 12, 17, 19} with corresponding weights w_i ∈ {0.5, 0.3, 0.4, −0.2, −0.3, 1.2, 0.9}, which cause it to spike at t_out = 19.39.

**Figure 2**
Architecture of the spiking autoencoder. The weights and the pulses are trainable.

**Figure 3**
Reconstruction errors for spiking (“snn”) and non-spiking (“ann”) autoencoders at different levels of noise, for embedding sizes 8, 16, and 32, on the MNIST and FMNIST datasets.

**Figure 4**
A digit from the MNIST test set reconstructed by a spiking autoencoder with embedding size 32 and target latency l = 1, at different levels of noise.

**Figure 5**
Visualisation of MNIST embeddings produced by a spiking autoencoders with target latency l = 1 at different levels of noise η and embedding sizes h, using the t-distributed stochastic neighbour embedding (t-SNE) technique, with perplexity set to 20. The results are qualitatively similar for different perplexity values. Axis units (not shown) are arbitrary and identical for each plot.

**Figure 6**
Interpolating between four items from the MNIST and FMNIST test sets in embedding space. The embeddings are generated by a spiking autoencoder with hidden layer size 32, target latency l = 1, noise level η = 0. They are then interpolated and, finally, run through the decoder layer to obtain the representation in original space.

**Figure 7**
Accuracy of an SVM classifying embeddings produced by spiking (“snn”) and non-spiking (“ann”) autoencoders at different levels of noise, for embedding sizes 8, 16, and 32, on the MNIST dataset. The baseline is the classification accuracy on the original set.

**Figure 8**
Spike distributions on the full test set in trained spiking autoencoders with embedding size 32, noise level η = 0, target latencies l = 1 and l = 16. The pulses are shown individually.

**Figure 9**
Output potentials during the reconstruction of a test example by spiking autoencoders with embedding size 32, noise level η = 0, target latencies l = 1 and l = 16. The output neuron is chosen such that the target spike time is smaller than l + 0.1 (in other words, it is located in the centre of the image and encodes salient digit information). The figure underlines the initial negative response of the membrane voltage, followed by a positive response caused by pulses.

**Figure 10**
Weight distributions in spiking autoencoders, for regular neurons and pulses. All models have embedding size h = 32 and noise level η = 0.

**Figure 11**
Reconstruction loss on the inverted-brightness MNIST dataset for spiking (“snn”) and non-spiking (“ann”) autoencoders. The embedding size is always h = 32. The spiking autoencoder has a target latency of l = 1. The non-spiking networks have either ReLU activation functions in the encoder and sigmoid activation functions in the decoder, or zero-centred Gaussian-like activation functions everywhere.

See this image and copyright information in PMC

References

1. Abadi M., Agarwal A., Barham P., Brevdo E., Chen Z., Citro C., et al. (2015). TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. Available online at: tensorflow.org.
1. Abbott L. F., DePasquale B., Memmesheimer R.-M. (2016). Building functional networks of spiking model neurons. Nat. Neurosci. 19, 350–355. 10.1038/nn.4241 - DOI - PMC - PubMed
1. Ahmed F. Y., Shamsuddin S. M., Hashim S. Z. M. (2013). Improved spikeprop for using particle swarm optimization. Math. Probl. Eng. 2013:257085. 10.1155/2013/257085 - DOI
1. Bengio Y., Courville A., Vincent P. (2013). Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35, 1798–1828. 10.1109/TPAMI.2013.50 - DOI - PubMed
1. Bengio Y., Lee D.-H., Bornschein J., Mesnard T., Lin Z. (2015). Towards biologically plausible deep learning. arXiv [preprint]. arXiv:1502.04156. Available online at: https://arxiv.org/abs/1502.04156

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Spiking Autoencoders With Temporal Coding

Affiliation

Spiking Autoencoders With Temporal Coding

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources