. 2013 Nov;11(11):e1001710.

doi: 10.1371/journal.pbio.1001710. Epub 2013 Nov 12.

Constructing noise-invariant representations of sound in the auditory pathway

Neil C Rabinowitz¹, Ben D B Willmore, Andrew J King, Jan W H Schnupp

Affiliations

Affiliation

¹ Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom ; Center for Neural Science, New York University, New York, New York, United States of America.

PMID: 24265596
PMCID: PMC3825667
DOI: 10.1371/journal.pbio.1001710

Constructing noise-invariant representations of sound in the auditory pathway

Neil C Rabinowitz et al. PLoS Biol. 2013 Nov.

. 2013 Nov;11(11):e1001710.

doi: 10.1371/journal.pbio.1001710. Epub 2013 Nov 12.

Authors

Neil C Rabinowitz¹, Ben D B Willmore, Andrew J King, Jan W H Schnupp

Affiliation

¹ Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom ; Center for Neural Science, New York University, New York, New York, United States of America.

PMID: 24265596
PMCID: PMC3825667
DOI: 10.1371/journal.pbio.1001710

Abstract

Identifying behaviorally relevant sounds in the presence of background noise is one of the most important and poorly understood challenges faced by the auditory system. An elegant solution to this problem would be for the auditory system to represent sounds in a noise-invariant fashion. Since a major effect of background noise is to alter the statistics of the sounds reaching the ear, noise-invariant representations could be promoted by neurons adapting to stimulus statistics. Here we investigated the extent of neuronal adaptation to the mean and contrast of auditory stimulation as one ascends the auditory pathway. We measured these forms of adaptation by presenting complex synthetic and natural sounds, recording neuronal responses in the inferior colliculus and primary fields of the auditory cortex of anaesthetized ferrets, and comparing these responses with a sophisticated model of the auditory nerve. We find that the strength of both forms of adaptation increases as one ascends the auditory pathway. To investigate whether this adaptation to stimulus statistics contributes to the construction of noise-invariant sound representations, we also presented complex, natural sounds embedded in stationary noise, and used a decoding approach to assess the noise tolerance of the neuronal population code. We find that the code for complex sounds in the periphery is affected more by the addition of noise than the cortical code. We also find that noise tolerance is correlated with adaptation to stimulus statistics, so that populations that show the strongest adaptation to stimulus statistics are also the most noise-tolerant. This suggests that the increase in adaptation to sound statistics from auditory nerve to midbrain to cortex is an important stage in the construction of noise-invariant sound representations in the higher auditory brain.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Figure 1. Single unit responses to clean and noisy sounds.**
Left column, the spectrogram of a segment of speech under four noise conditions, with the noise level increasing (i.e., the SNR decreasing) from top to bottom. Second to fourth columns, example rasters showing the responses of sAN responses and of responses recorded in the IC and AC, over 50 stimulus presentations. Gray lines, average PSTH.

**Figure 2. Along the auditory pathway, neurons' response distributions become increasingly independent of the level of background noise.**
(A) Average distribution of normalized firing rates by location/SNR. For each unit, , where is the firing rate. This shows that the average response distribution within the population changes less with noise in higher auditory centers. (B) Kullback–Leibler divergence between individual units' normalized firing-rate distributions evoked from clean sounds and evoked from noisy sounds. Smaller values indicate that firing rate distributions were similar. This shows that individual neurons' response distributions change less with noise in higher auditory centers. (C) Statistical independence of stimulus-conditioned response distributions to the background noise level (see Materials and Methods for details of metric). Lower values indicate that response distributions were highly dependent on the stimulus SNR; a value of 1 indicates that response distributions were completely independent of the stimulus SNR. Median values of 0.80/0.84/0.88 for sAN/IC/AC (, pairwise rank-sums tests).

formula image — **Figure 2. Along the auditory pathway, neurons' response distributions become increasingly independent of the level of background noise.**
(A) Average distribution of normalized firing rates by location/SNR. For each unit, , where is the firing rate. This shows that the average response distribution within the population changes less with noise in higher auditory centers. (B) Kullback–Leibler divergence between individual units' normalized firing-rate distributions evoked from clean sounds and evoked from noisy sounds. Smaller values indicate that firing rate distributions were similar. This shows that individual neurons' response distributions change less with noise in higher auditory centers. (C) Statistical independence of stimulus-conditioned response distributions to the background noise level (see Materials and Methods for details of metric). Lower values indicate that response distributions were highly dependent on the stimulus SNR; a value of 1 indicates that response distributions were completely independent of the stimulus SNR. Median values of 0.80/0.84/0.88 for sAN/IC/AC (, pairwise rank-sums tests).

**Figure 3. Effect of background noise on incoming signals within neurons' receptive fields.**
(A) Left, sound intensity within a cortical neuron's receptive field for clean (20 dB) and noisy (0 dB) stimulation (see Figure S1B). Right, distribution of the sounds' within-channel intensities. (B) Signals in (A) after adaptation to signal statistics.

**Figure 4. Increasing adaptation to stimulus baseline along the auditory pathway.**
(A) Calculation of BI, a measure of -adaptation, for an example sAN fiber. CDF, cumulative distribution of firing rates. , the 33rd percentile of the CDF under clean sound stimulation —that is, the firing rate with the cumulative probability . BI indicates how little changes with SNR, as . (B) Units' BI in each location.

**Figure 5. Increasing adaptation to stimulus contrast along the auditory pathway.**
(A) Schematic of adaptive-LN model. Top/bottom, DRC stimuli. DRCs are filtered through a STRF, then passed through an output nonlinearity, yielding the firing rate (). Output nonlinearities change with stimulus contrast. Insets, example time series. (B) Example units, nonlinearities during low (blue) and high (red) contrast DRCs. Insets, STRFs. Bottom, distributions of STRF-filtered DRCs under low/high contrast. (C) Nonlinearities in (B), replotted in normalized coordinates. (D) Contrast-dependent changes to the slope of units' nonlinearities. (E) Percentage of residual signal power explained by gain kernel model above an LN model . (F) Log increase in Fisher information in units' encoding of low contrast stimuli, resulting from adaptation to this distribution. Zero, no adaptation. Larger positive values, greater adaptation.

**Figure 6. Decoding the population representations of clean and noisy sounds.**
Schematic of the decoding of neural responses. For each auditory center, a decoder was trained to reconstruct the clean sound spectrogram from the population responses to the clean sounds. We then measured the performance of these decoders when reconstructing spectrograms from the responses to both clean and noisy sounds. Top row, spectrogram of a 2(20 dB SNR) and noisy (10/0/−10 dB SNR) conditions. Left column, decoder training from responses to clean sounds. Population responses are shown as neurograms: each row depicts the time-varying firing rate of a single unit in the population; rows are organized by CF. Right, reconstructed spectrograms () from population responses to noisy sounds, using the same decoders as trained on the left. The similarity between the reconstructed spectrogram and the presented spectrogram is measured by ; likewise, the similarity between and the original, clean spectrogram is measured by . The tendencies for the sAN decoder to produce -like spectrograms, and the IC and AC decoders to produce -like spectrograms, are most visible for the 0 dB and −10 dB conditions.

**Figure 7. Population representations of natural sounds become more noise-tolerant along the auditory pathway.**
(A) Similarity between decoded responses to the clean sounds (), and the clean sounds' spectrograms (). Abscissa, sampled population size. Colored areas, bootstrapped 95% confidence intervals. (B–C) Similarity between decoded responses to the noisy sounds (), and the spectrograms of the presented, noisy sounds (B), or the spectrograms of the original, clean sounds (C). Reconstructions are from the full populations in each location. Red bars are the same in (B) and (C), denoting (i.e., the rightmost points for each curve in A). Error bars, bootstrapped 95% confidence intervals. (D) Index of whether decoded responses were more similar to the presented, noisy sound (negative values), or the original, clean sound (positive values). Similarities denoted by asterisks () are normalized to the maximum score for each location, . Error bars, 95% confidence intervals. Pairwise comparison statistics (bootstrapped): (***), (**), (*). (E) Decoder accuracy in recovering the clean sound's identity from noisy responses, relative to accuracy in doing so from clean responses.

**Figure 8. Higher - and -adaptation explain the increased noise-tolerance of population representations.**
(A) Relationship between decoder performance and BI (measure of -adaptation). Each point represents a subpopulation (one quarter) of the units from each of the sAN/IC/AC populations, subdivided according to units' BI (values in Figure 4B). Abscissa, mean BI in the subpopulation. Ordinate, performance of the subpopulation decoder. Lines, linear fit per SNR. (B) Relationship between decoder performance and CI (measure of -adaptation), similar to (A). Here, each point represents a subpopulation (one quarter) of the units from each of the sAN/IC/AC populations, subdivided according to the amount of units' contrast adaptation (values in Figure 5D). sAN values of were adjusted for low BI (see Figure S6).

See this image and copyright information in PMC

Cited by

Adaptation of the human auditory cortex to changing background noise.
Khalighinejad B, Herrero JL, Mehta AD, Mesgarani N. Khalighinejad B, et al. Nat Commun. 2019 Jun 7;10(1):2509. doi: 10.1038/s41467-019-10611-4. Nat Commun. 2019. PMID: 31175304 Free PMC article.
Contrast gain control occurs independently of both parvalbumin-positive interneuron activity and shunting inhibition in auditory cortex.
Cooke JE, Kahn MC, Mann EO, King AJ, Schnupp JWH, Willmore BDB. Cooke JE, et al. J Neurophysiol. 2020 Apr 1;123(4):1536-1551. doi: 10.1152/jn.00587.2019. Epub 2020 Mar 18. J Neurophysiol. 2020. PMID: 32186432 Free PMC article.
Thresholding of auditory cortical representation by background noise.
Liang F, Bai L, Tao HW, Zhang LI, Xiao Z. Liang F, et al. Front Neural Circuits. 2014 Nov 10;8:133. doi: 10.3389/fncir.2014.00133. eCollection 2014. Front Neural Circuits. 2014. PMID: 25426029 Free PMC article.
Compensating Level-Dependent Frequency Representation in Auditory Cortex by Synaptic Integration of Corticocortical Input.
Happel MF, Ohl FW. Happel MF, et al. PLoS One. 2017 Jan 3;12(1):e0169461. doi: 10.1371/journal.pone.0169461. eCollection 2017. PLoS One. 2017. PMID: 28046062 Free PMC article.
Reduced Neural Responses to Natural Foreground versus Background Sounds in the Auditory Cortex.
Hamersky GR, Shaheen LA, Espejo ML, Wingert JC, David SV. Hamersky GR, et al. J Neurosci. 2025 Mar 5;45(10):e0121242024. doi: 10.1523/JNEUROSCI.0121-24.2024. J Neurosci. 2025. PMID: 39837664

See all "Cited by" articles

References

1. Joris PX, Schreiner CE, Rees A (2004) Neural processing of amplitude-modulated sounds. Physiol Rev 84: 541–577. - PubMed
1. Young ED (2008) Neural representation of spectral and temporal information in speech. Philos Trans R Soc Lond B Biol Sci 363: 923–945. - PMC - PubMed
1. Schreiner CE, Froemke RC, Atencio CA (2011) Spectral processing in auditory cortex. In: Winer JA, Schreiner CE, editors, The auditory cortex, Springer. pp. 275–308.
1. Formisano E, Martino FD, Bonte M, Goebel R (2008) “Who” is saying “what”? brain-based decoding of human voice and speech. Science 322: 970–973. - PubMed
1. Okada K, Rong F, Venezia J, Matchin W, Hsieh IH, et al. (2010) Hierarchical organization of human auditory cortex: evidence from acoustic invariance in the response to intelligible speech. Cereb Cortex 20: 2486–2495. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Constructing noise-invariant representations of sound in the auditory pathway

Affiliation

Constructing noise-invariant representations of sound in the auditory pathway

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources