Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011;6(10):e25506.
doi: 10.1371/journal.pone.0025506. Epub 2011 Oct 13.

An efficient coding hypothesis links sparsity and selectivity of neural responses

Affiliations

An efficient coding hypothesis links sparsity and selectivity of neural responses

Florian Blättler et al. PLoS One. 2011.

Abstract

To what extent are sensory responses in the brain compatible with first-order principles? The efficient coding hypothesis projects that neurons use as few spikes as possible to faithfully represent natural stimuli. However, many sparsely firing neurons in higher brain areas seem to violate this hypothesis in that they respond more to familiar stimuli than to nonfamiliar stimuli. We reconcile this discrepancy by showing that efficient sensory responses give rise to stimulus selectivity that depends on the stimulus-independent firing threshold and the balance between excitatory and inhibitory inputs. We construct a cost function that enforces minimal firing rates in model neurons by linearly punishing suprathreshold synaptic currents. By contrast, subthreshold currents are punished quadratically, which allows us to optimally reconstruct sensory inputs from elicited responses. We train synaptic currents on many renditions of a particular bird's own song (BOS) and few renditions of conspecific birds' songs (CONs). During training, model neurons develop a response selectivity with complex dependence on the firing threshold. At low thresholds, they fire densely and prefer CON and the reverse BOS (REV) over BOS. However, at high thresholds or when hyperpolarized, they fire sparsely and prefer BOS over REV and over CON. Based on this selectivity reversal, our model suggests that preference for a highly familiar stimulus corresponds to a high-threshold or strong-inhibition regime of an efficient coding strategy. Our findings apply to songbird mirror neurons, and in general, they suggest that the brain may be endowed with simple mechanisms to rapidly change selectivity of neural responses to focus sensory processing on either familiar or nonfamiliar stimuli. In summary, we find support for the efficient coding hypothesis and provide new insights into the interplay between the sparsity and selectivity of neural responses.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Efficient coding of cochlear spectrograms.
(A) Schematic of the auditory pathway in the songbird forebrain. Auditory input to the pallial field L is provided by the thalamic nucleus ovoidalis (Ov). From field L, auditory information is relayed to the caudal mesopallium (CM), and from there to the nucleus interface of the nidopallium (NIf) and to HVC. (B) Network model. At time formula image, the auditory input to the network is a 50-ms window formula image of the sound spectrogram. This input is multiplied by synaptic weights formula image to result in total synaptic currents formula image onto formula image neurons. formula image stands for whitening and dimensionality reduction (principal component analysis), and formula image stands for a sparseness transformation. Neural firing rates are given by rectified synaptic currents. (C) Cost function for a threshold formula image. Subthreshold synaptic currents formula image are punished quadratically and suprathreshold currents are punished linearly. The parameter formula image defining the subthreshold current eliciting minimal cost is set to the expected subthreshold current formula image. Synaptic currents are reported in units of mean-subtracted standard deviations (z-scores). A threshold formula image implies that suprathreshold currents are depolarizing (positive), whereas subthreshold currents are hyperpolarizing (negative). formula image.
Figure 2
Figure 2. Spectral temporal receptive fields.
(A)Spectral temporal receptive fields (STRFs) of formula image neurons, arranged by nearest-neighbor similarity (circular boundary conditions). Neurons tend to be either temporally tuned (vertical stripes, top right), spectrally tuned (horizontal stripes, middle rows), or display more complex spectro-temporal patterns. Spectral resolution is 172 Hz, offset between subsequent cochlear inputs is 1.5 ms. (B) STRFs obtained with a linear cost on synaptic weight magnitudes. The linear cost forces many synaptic weights to be close to zero (green), leading to low-density STRFs most of which contain a smaller number of excitatory and inhibitory subfields than in A. Interestingly, excitatory and inhibitory subfields tend to be close to each other and aligned horizontally or vertically, similar to observations in field L neurons. The 100 presented STRFs were randomly chosen out of the total 800. formula image.
Figure 3
Figure 3. Receptive fields and neurogram.
(A) Power spectrogram of a bird's own song (BOS). (B) STRFs formula image of eight representative neurons (formula image). The horizontal alignment of STRFs with the spectrogram in A is such that the trailing edges of the STRFs correspond to the respective peak times of synaptic currents. The temporal axis of the STRFs is inverted for better comparison with the BOS spectrogram. (C) Stack plot of synaptic currents of representative neurons in B in response to ten different versions of BOS, vertically aligned to A. (D) Neurogram of synaptic currents in response to the BOS in A. The formula image neurons are sorted according to the peak times of their synaptic currents. Fewer neurons display synaptic current peaks during syllable gaps (blue arrows) than during syllables.
Figure 4
Figure 4. Neurons encode behavioral variability, for example song pitch.
(A) Two receptive fields formed by training a network on all songs produced by a bird on a single day. (B) Spectrograms of a song syllable containing a harmonic stack. The left version has median pitch 1024 Hz, the right version 1138 Hz. (C) Stack plot of synaptic currents in the two neurons elicited by 813 syllable renditions. The stack plots have been sorted identically to reveal that for a given syllable rendition either the left or right neuron exhibits a peak in synaptic current, but not both. Peaks in synaptic currents are computed in intervals indicated by red bars on the bottom. (D) Scatter plot of peak synaptic currents in the two neurons. The distribution is sparse (‘L’-shaped). (E) Median synaptic current in same intervals versus median song pitch of the harmonic stack. The two neurons are detectors of low and high pitch versions of the stack, respectively. Red and blue lines are linear regressions (Neuron 1: formula image, formula image, Neuron 2: formula image, formula image), formula image, formula image.
Figure 5
Figure 5. Actual STRFs and STRFs estimated using reverse correlation.
(A) A selection of twelve STRFs formula image obtained after convergence of the algorithm (formula image). (B) Estimated STRFs (reverse correlation) based on the predicted firing rates formula image. Shown are only estimated STRFs for neurons associated with a correlation coefficient formula image between predicted and actual firing rates of formula image. formula image. formula image.
Figure 6
Figure 6. Probability density of total synaptic currents.
(A) The probability density of total synaptic currents formula image averaged over all neurons has a heavy tail on the positive side. Shown are the densities for BOS (blue), CON (green), and REV (black). Near zero synaptic currents, the curves are approximatively unit Gaussian (red), though their excessive peaks are slightly shifted to the negative side (inset, arrow). The curves cross each other such that large positive synaptic currents are preferentially elicited by the BOS and small positive currents by REV and CON. formula image, formula image. (B) The distributions of synaptic currents for sparse STRFs (Figure 2B) are qualitatively similar to (A). The only noticeable difference is that the distribution for REV is closer to BOS, reflecting a lower selectivity for temporal order. formula image.
Figure 7
Figure 7. Probability densities of mean firing rates.
Mean firing rates in response to (A) a BOS stimulus and (B) a CON stimulus. For each cell we computed the mean firing rate to one stimulus trial. Our simulation data (blue asterisks) are better fit by log-normal densities (red) than by exponential densities (black). Firing rates are plotted in arbitrary units. Fit parameters for log-normal densities were determined by the mean and variance of logarithmic firing rates, and for exponential densities they were determined by the mean firing rates. Thresholds varied from formula image to formula image. Noise amplitude formula image.
Figure 8
Figure 8. Sparsification reduces firing dependences.
(A) The sparseness transformation renders the size distribution of coactive neuron groups (whitening+sparseness) closer to binomial. The probability formula image of the binomial distribution (that a neuron is active per unit time) was estimated in terms of the firing density (the fraction of suprathreshold events over all neurons and training stimuli). Probabilities formula image were nearly identical for whitening and whitening+sparseness when formula image (formula image and formula image during learning). (B) The Kullback–Leibler divergence between size distributions is smaller when comparing the whitening+sparseness model to the binomial model than when comparing the whitening model to the binomial model, for nearly all firing densities tested.
Figure 9
Figure 9. Reconstructing the cochlear spectrograms from firing rates.
(A) Firing-rate of one example neuron in response to BOS for increasing firing thresholds (formula image to formula image). The BOS spectrogram is shown on top. This neuron is tuned to a feature present in introductory notes and responds to it up to thresholds higher than seven. For each threshold, ten different responses are plotted, corresponding to ten different instantiations of synaptic noise. formula image. (B) The reconstruction of a BOS spectrogram (orig., top) using all neurons, based on a firing threshold of minus infinity (whi., 2nd from top) is fairly complete with little information loss (arising from dimensionality reduction). With increasing thresholds (below), more and more syllables are lost in the reconstruction, but the reconstructed spectro-temporal patterns remain clearly recognizable. The arrow points to a down-sweep syllable. (C) Reconstructions of REV (flipped horizontally for comparison with B) are worse than reconstructions of BOS at the same threshold; for example the down-sweep syllable is not well reconstructed (arrow), presumably because zebra finches produce almost no up-sweeps. (D) The fraction of active neurons (averaged over all BOS stimuli) decreases with increasing threshold such that at formula image about 1% of neurons are active on average. This fraction decreases to 0.1% at about formula image. (E) The reconstruction errors averaged over different stimulus ensembles are monotonic functions of the firing threshold. For a given positive threshold, reconstruction errors increase from BOS to CON to REV. formula image threshold-linear neurons.
Figure 10
Figure 10. Smart suppression of electrical noise affecting the recordings.
(A) The STRFs of five neurons that encoded monitor noise. (B) Original BOS spectrogram. The noise is manifest as gray horizontal bands (black arrows) during syllable gaps. (C) Reconstruction of the BOS (Equation 14) from elicited responses in the network. The thresholds of all neurons were set to formula image, with exception of the five neurons in A in which the thresholds were set to formula image. The monitor noise has vanished in the reconstructions, without affecting the birdsong signal.
Figure 11
Figure 11. selectivity for BOS reverses at high firing thresholds.
(A) Example model neuron with reversing BOS-CON selectivity. This neuron's STRF (inlay) codes for an up-sweep from 500 to 800 Hz over 20 ms. The resulting BOS-CON formula image selectivity is negative for low thresholds formula image and turns positive for thresholds formula image. (B) Example cumulative distributions of BOS-REV (blue) and BOS-CON (red) formula image selectivities across formula image neurons for formula image (solid lines) and formula image (dashed lines). For formula image the selectivities are biased towards negative values, whereas for formula image the distributions are biased towards positive values. (C) Bar plot summarizing BOS-REV (blue) and BOS-CON (red) selectivities for a wide range of firing thresholds. The colored bars indicate the median formula image selectivity and the error bars delimit the first and third quartiles. Selectivity reverses at around formula image (REV) and formula image (CON).
Figure 12
Figure 12. Selectivity for CON vs. different artificial stimuli.
Depicted are median selectivities formula image quartiles. PIP = tone-pip stimuli, WN = white noise.
Figure 13
Figure 13. Selectivity in two layers.
Median and quartile selectivities in a network of two layers for various firing thresholds formula image in the first layer and formula image in the second layer. In each simulation, second-layer responses were evaluated using the first-layer threshold applied during training. As can be seen, BOS preference in the second layer is restricted to the high-sparseness regime there (right part of the three subplots).

Similar articles

Cited by

References

    1. Waydo S, Kraskov A, Quian Quiroga R, Fried I, Koch C. Sparse Representation in the Human Medial Temporal Lobe. J Neurosci. 2006;26:10232–10234. - PMC - PubMed
    1. Hochstein S, Ahissar M. View from the top - hierarchies and reverse hierarchies in the visual system. Neuron. 2002;36:791–804(14). - PubMed
    1. Margoliash D. Acoustic parameters underlying the responses of song-specific neurons in the white-crowned sparrow. J Neurosci. 1983;3:1039–1057. - PMC - PubMed
    1. Quiroga RQ, Reddy L, Kreiman G, Koch C, Fried I. Invariant visual representation by single neurons in the human brain. Nature. 2005;435:1102–1107. - PubMed
    1. Olshausen BA, Field DJ. Sparse coding of sensory input. Curr Opin Neurobiol. 2004;14:481–487. - PubMed

Publication types