. 2012 Dec;13(6):835-52.

doi: 10.1007/s10162-012-0343-2. Epub 2012 Aug 8.

Level-dependent changes in perception of speech envelope cues

Judy R Dubno¹, Jayne B Ahlstrom, Xin Wang, Amy R Horwitz

Affiliations

Affiliation

¹ Department of Otolaryngology-Head and Neck Surgery, Medical University of South Carolina, 135 Rutledge Avenue, MSC 550, Charleston, SC 29425-5500, USA. dubnojr@musc.edu

PMID: 22872414
PMCID: PMC3505593
DOI: 10.1007/s10162-012-0343-2

Level-dependent changes in perception of speech envelope cues

Judy R Dubno et al. J Assoc Res Otolaryngol. 2012 Dec.

. 2012 Dec;13(6):835-52.

doi: 10.1007/s10162-012-0343-2. Epub 2012 Aug 8.

Authors

Judy R Dubno¹, Jayne B Ahlstrom, Xin Wang, Amy R Horwitz

Affiliation

¹ Department of Otolaryngology-Head and Neck Surgery, Medical University of South Carolina, 135 Rutledge Avenue, MSC 550, Charleston, SC 29425-5500, USA. dubnojr@musc.edu

PMID: 22872414
PMCID: PMC3505593
DOI: 10.1007/s10162-012-0343-2

Abstract

Level-dependent changes in temporal envelope fluctuations in speech and related changes in speech recognition may reveal effects of basilar-membrane nonlinearities. As a result of compression in the basilar-membrane response, the "effective" magnitude of envelope fluctuations may be reduced as speech level increases from lower level (more linear) to mid-level (more compressive) regions. With further increases to a more linear region, speech envelope fluctuations may become more pronounced. To assess these effects, recognition of consonants and key words in sentences was measured as a function of speech level for younger adults with normal hearing. Consonant-vowel syllables and sentences were spectrally degraded using "noise vocoder" processing to maximize perceptual effects of changes to the speech envelope. Broadband noise at a fixed signal-to-noise ratio maintained constant audibility as speech level increased. Results revealed significant increases in scores and envelope-dependent feature transmission from 45 to 60 dB SPL and decreasing scores and feature transmission from 60 to 85 dB SPL. This quadratic pattern, with speech recognition maximized at mid levels and poorer at lower and higher levels, is consistent with a role of cochlear nonlinearities in perception of speech envelope cues.

PubMed Disclaimer

Figures

**FIG. 1**
One-third-octave band spectra of vocoded CV syllables (*blue lines*), sentences (*red lines*), and background noise (*thick black lines*) for five overall speech levels (45, 50, 60, 70, and 85 dB SPL). Mean quiet thresholds are also shown in each panel (*triangles*).

**FIG. 2**
Mean (*thick lines*) and individual (*thin lines*) recognition scores plotted as a function of speech level, for consonants (*blue*, *top*) and key words in sentences (*red*, *middle*). Mean scores for the two speech materials are also displayed in the *bottom panel*. For clarity, some data points are offset along the *abscissa*. *Error bars* indicate ±1 standard deviation.

**FIG. 3**
Key word recognition scores plotted against consonant recognition scores, for five speech levels (*top to bottom panels*). Pearson correlation coefficients and linear regressions are included in each panel.

**FIG. 4**
*Top:* mean information transmitted plotted as a function of speech level for the three acoustic-phonetic features of voicing, manner of articulation, and place of articulation. *Bottom*: same as *top panel*, but for three sub-categories of manners of articulation (plosive, nasality, and frication).

**FIG. 5**
Slope (percent per dB) at the highest speech level (85 dB SPL) plotted against slope at the lowest speech level (45 dB SPL), for recognition of consonants (*top*) and key words in sentences (*bottom*). Slopes for each speech material were computed from the polynomial fit applied to the score-level function for each subject. Pearson correlation coefficients and linear regression functions are included in each panel.

**FIG. 6**
Slope (percent per dB) calculated from speech scores at the highest level (85 dB SPL) plotted against the range of scores for recognition of consonants (*top*) and key words in sentences (*bottom*). Pearson correlation coefficients and linear regression functions are included in each panel.

**FIG. 7**
*Top:* DPOAE levels plotted as a function of L ₂ for f ₂ of 1.0 kHz. DPOAE input–output function slopes were computed from DPOAE levels recorded for L ₂ between 40 and 65 dB SPL (*red lines*). *Bottom:* slopes of DPOAE input–output functions for f ₂ at 1.0 kHz (from the *top panel*) plotted against DPOAE summed levels. The Pearson correlation coefficient and linear regression function are also included.

**FIG. 8**
Slopes of DPOAE input–output functions for f ₂ of 2.0 kHz plotted against the range of key word recognition scores. The Pearson correlation coefficient and linear regression function are also included.

**FIG. 9**
*Top:* Slopes of DPOAE input–output functions for an f ₂ of 1.0 kHz plotted against the change in consonant recognition scores with speech level increasing from 60 to 70 dB SPL. *Bottom:* slopes of DPOAE input–output functions for an f ₂ of 2.0 kHz plotted against the change in key word recognition scores with speech level increasing from 60 to 70 dB SPL. Pearson correlation coefficients and linear regression functions are included in each panel.

See this image and copyright information in PMC

Cited by

Compression and amplification algorithms in hearing aids impair the selectivity of neural responses to speech.
Armstrong AG, Lam CC, Sabesan S, Lesica NA. Armstrong AG, et al. Nat Biomed Eng. 2022 Jun;6(6):717-730. doi: 10.1038/s41551-021-00707-y. Epub 2021 May 3. Nat Biomed Eng. 2022. PMID: 33941898 Free PMC article.
Adaptation to Noise in Human Speech Recognition Unrelated to the Medial Olivocochlear Reflex.
Marrufo-Pérez MI, Eustaquio-Martín A, Lopez-Poveda EA. Marrufo-Pérez MI, et al. J Neurosci. 2018 Apr 25;38(17):4138-4145. doi: 10.1523/JNEUROSCI.0024-18.2018. Epub 2018 Mar 28. J Neurosci. 2018. PMID: 29593051 Free PMC article.
Recognition of spectrally shaped speech in speech-modulated noise: Effects of age, spectral shape, speech level, and vocoding.
Fogerty D, Ahlstrom JB, Dubno JR. Fogerty D, et al. JASA Express Lett. 2023 Apr 1;3(4):044402. doi: 10.1121/10.0017772. JASA Express Lett. 2023. PMID: 37096892 Free PMC article.
Adaptation to Noise in Human Speech Recognition Depends on Noise-Level Statistics and Fast Dynamic-Range Compression.
Marrufo-Pérez MI, Sturla-Carreto DDP, Eustaquio-Martín A, Lopez-Poveda EA. Marrufo-Pérez MI, et al. J Neurosci. 2020 Aug 19;40(34):6613-6623. doi: 10.1523/JNEUROSCI.0469-20.2020. Epub 2020 Jul 17. J Neurosci. 2020. PMID: 32680938 Free PMC article.

References

1. Alves-Pinto A, Lopez-Poveda EA. Detection of high-frequency spectral notches as a function of level. J Acoust Soc Am. 2005;118:2458–2469. doi: 10.1121/1.2032067. - DOI - PubMed
1. American National Standards Institute (2004) Specification for audiometers. ANSI S3.6-2004, American National Standards Institute, New York
1. Guidelines for manual pure-tone threshold audiometry. MD: American Speech-Language-Hearing Association; 2005.
1. Başkent D. Speech recognition in normal hearing and sensorineural hearing loss as a function of the number of spectral channels. J Acoust Soc Am. 2006;120:2908–2925. doi: 10.1121/1.2354017. - DOI - PubMed
1. Bess FH, Josey AF, Humes LE. Performance-intensity functions in cochlear and eighth nerve disorders. Am J Otol. 1979;1:27–31. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Level-dependent changes in perception of speech envelope cues

Affiliation

Level-dependent changes in perception of speech envelope cues

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials

Miscellaneous