Speech recognition in noise: estimating effects of compressive nonlinearities in the basilar-membrane response
- PMID: 17804982
- DOI: 10.1097/AUD.0b013e31812f7156
Speech recognition in noise: estimating effects of compressive nonlinearities in the basilar-membrane response
Abstract
Objectives: This experiment was designed to estimate effects of cochlear nonlinearities on tonal and speech masking for individuals with normal hearing who have a range of quiet thresholds. Physiological and psychophysical evidence indicates that for signals close to the characteristic frequency (CF) of a place on the basilar membrane, the normal growth of response of the basilar membrane is linear at lower stimulus levels and compressed at medium to higher stimulus levels. In contrast, at moderate to high CFs, the basilar membrane responds more linearly to stimuli at frequencies well below the CF regardless of input level. Thus, the hypothesis tested was that masker effectiveness would change as a function of stimulus level consistent with the underlying basilar membrane response. Specifically, with a fixed-level speech signal and a speech-shaped masker that ranges from low to higher levels, the resulting response of the basilar membrane to the masker would be linear at lower levels and compressed at medium to higher levels. This would result in relatively less effective masking at higher masker levels. It was further hypothesized that the transition from linear to compressed responses to both tones and maskers would occur at higher levels for listeners with higher quiet thresholds than for listeners with lower quiet thresholds.
Design: Tonal thresholds and speech recognition in noise were measured as a function of masker level. A 10-msec, 2.0-kHz tone was presented in a lower frequency masker ranging from 40 to 85 dB SPL. Moderate-level speech was presented in interrupted noise at six levels ranging from 47 to 77 dB SPL. To minimize differences in speech audibility that could arise during the "off" periods of the interrupted noise, a low-level steady-state "threshold-matching noise" was also present during measurement of speech recognition. Subjects were 30 adults with normal hearing with a 20-dB range of average quiet thresholds.
Results: Tonal breakpoints (i.e., the levels corresponding to the transitions from linear to nonlinear responses) were significantly correlated with quiet thresholds, whereas slopes measured above the breakpoints were not. Speech recognition in noise was consistent with the hypothesis that the response of the basilar membrane to the masker was linear at lower levels and compressed at medium to higher levels, resulting in less effective masking at higher masker levels. That is, at lower masker levels, as masker level increased, mean observed speech scores declined as predicted using the articulation index, an audibility-based model. With further increases in masker level, mean scores declined less than predicted. Moreover, for subjects with higher quiet thresholds, masker effectiveness remained constant for a wider range of masker levels than for subjects with lower quiet thresholds, consistent with the hypothesis that the transition from linear to compressed responses occurred at higher levels. Finally, significant negative correlations were obtained between individual subjects' tonal and speech measures.
Conclusions: Results from tonal and speech tasks were consistent with basilar membrane nonlinearities and consistent with changes in nonlinearities with minor threshold elevations, providing support for their role in the understanding of speech in noise with increases in noise level.
Similar articles
-
Estimates of basilar-membrane nonlinearity effects on masking of tones and speech.Ear Hear. 2007 Feb;28(1):2-17. doi: 10.1097/AUD.0b013e3180310212. Ear Hear. 2007. PMID: 17204895
-
Auditory brainstem correlates of basilar membrane nonlinearity in humans.Audiol Neurootol. 2009;14(2):88-97. doi: 10.1159/000158537. Epub 2008 Oct 1. Audiol Neurootol. 2009. PMID: 18827479
-
Speech recognition in fluctuating and continuous maskers: effects of hearing loss and presentation level.J Speech Lang Hear Res. 2004 Apr;47(2):245-56. doi: 10.1044/1092-4388(2004/020). J Speech Lang Hear Res. 2004. PMID: 15157127
-
Effects of masker frequency and duration in forward masking: further evidence for the influence of peripheral nonlinearity.Hear Res. 2000 Dec;150(1-2):258-66. doi: 10.1016/s0378-5955(00)00206-9. Hear Res. 2000. PMID: 11077208 Review.
-
Psychoacoustic consequences of compression in the peripheral auditory system.Psychol Rev. 1998 Jan;105(1):108-24. doi: 10.1037/0033-295x.105.1.108. Psychol Rev. 1998. PMID: 9450373 Review.
Cited by
-
What is the role of the medial olivocochlear system in speech-in-noise processing?J Neurophysiol. 2012 Mar;107(5):1301-12. doi: 10.1152/jn.00222.2011. Epub 2011 Dec 7. J Neurophysiol. 2012. PMID: 22157117 Free PMC article.
-
Extended High-Frequency Bandwidth Improves Speech Reception in the Presence of Spatially Separated Masking Speech.Ear Hear. 2015 Sep-Oct;36(5):e214-24. doi: 10.1097/AUD.0000000000000161. Ear Hear. 2015. PMID: 25856543 Free PMC article.
-
Influence of broad auditory tuning on across-frequency integration of speech patterns.J Speech Lang Hear Res. 2010 Oct;53(5):1087-95. doi: 10.1044/1092-4388(2010/09-0185). Epub 2010 Aug 5. J Speech Lang Hear Res. 2010. PMID: 20689025 Free PMC article.
-
Masking release for words in amplitude-modulated noise as a function of modulation rate and task.J Acoust Soc Am. 2009 Jul;126(1):269-80. doi: 10.1121/1.3129506. J Acoust Soc Am. 2009. PMID: 19603883 Free PMC article.
-
Individual differences in behavioral estimates of cochlear nonlinearities.J Assoc Res Otolaryngol. 2012 Feb;13(1):91-108. doi: 10.1007/s10162-011-0291-2. Epub 2011 Sep 22. J Assoc Res Otolaryngol. 2012. PMID: 21938546 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous