Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jul;128(1):291-9.
doi: 10.1121/1.3436565.

Prior listening in rooms improves speech intelligibility

Affiliations

Prior listening in rooms improves speech intelligibility

Eugene Brandewie et al. J Acoust Soc Am. 2010 Jul.

Abstract

Although results from previous studies have demonstrated that the acoustic effects of a single reflection are perceptually suppressed after repeated exposure to a particular configuration of source and reflection, the extent to which this dynamic echo suppression might generalize to speech understanding in room environments with multiple reflections and reverberation is largely unknown. Here speech intelligibility was measured using the coordinate response measure corpus both with and without prior listening exposure to a reverberant room environment, which was simulated using virtual auditory space techniques. Prior room listening exposure was manipulated by presenting either a two-sentence carrier phrase that preceded the target speech, or no carrier phrase within the room environment. Results from 14 listeners indicate that with prior room exposure, masked speech reception thresholds were on average 2.7 dB lower than thresholds without exposure, an improvement in intelligibility of over 18 percentage points on average. This effect, which is shown to be absent in anechoic space and greatly reduced under monaural listening conditions, demonstrates that prior binaural exposure to reverberant rooms can improve speech intelligibility, perhaps due to a process of perceptual adaptation to the acoustics of the listening room.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Room dimensions and the position of the listener, the speech target, and the noise masker for all simulated rooms
Figure 2
Figure 2
Examples of the target and masker waveforms (left-ear only) for the NC and SC conditions in a moderately reverberant room (R2).
Figure 3
Figure 3
Proportion of correct responses, P(C), as a function of signal-to-noise ratio for a single listener (LIS) in Exp. I. Data from the moderately reverberant room (R2) for both the NC and SC conditions are shown, along with logistic function fits for each condition (see text for details). Each data point is based on responses from 30 trials. Speech reception threshold estimates and their 95% confidence intervals are indicated for each curve at the midpoint between chance and perfect performance.
Figure 4
Figure 4
Speech reception thresholds from Exp. I for the NC and SC conditions in room R2. 95% confidence intervals are displayed for each threshold estimate. Listeners are rank-ordered from left to right by effect size (NC threshold–SC threshold). Threshold values based on function fits to the data pooled across all listeners are shown on the right.
Figure 5
Figure 5
Speech reception thresholds from Exp. II for the NC and SC conditions in anechoic space (R0). 95% confidence intervals are displayed for each threshold estimate. The left to right listener order is the same as shown in Fig. 4, except that listener LIT did not participate in Exp. II. Threshold values based on function fits to the data pooled across all listeners are shown on the right.
Figure 6
Figure 6
Speech reception thresholds from Exp. III for the NC and SC conditions in R2 under monaural listening conditions (left-ear only). 95% confidence intervals are displayed for each threshold estimate. The left to right listener order is the same as shown in Fig. 4. Threshold values based on function fits to the data pooled across all listeners are shown on the right.

References

    1. Allen, J. B., and Berkley, D. A. (1979). “Image method for efficiently simulating small-room acoustics,” J. Acoust. Soc. Am. JASMAN 65, 943–950.10.1121/1.382599 - DOI
    1. Berg, B. G., and Green, D. M. (1990). “Spectral weights in profile listening,” J. Acoust. Soc. Am. JASMAN 88, 758–766.10.1121/1.399725 - DOI - PubMed
    1. Blauert, J., and Col, J. P. (1992). “Irregularities in the precedence effect,” in Auditory Physiology and Perception, edited by Cazals Y., Demany L., and Horner K. (Pergamon, Oxford: ), pp. 531–538.
    1. Bolia, R. S., Nelson, W. T., Ericson, M. A., and Simpson, B. D. (2000). “A speech corpus for multitalker communications research,” J. Acoust. Soc. Am. JASMAN 107, 1065–1066.10.1121/1.428288 - DOI - PubMed
    1. Carney, L. H., and Yin, T. C. T. (1989). “Responses of low-frequency cells in the inferior colliculus to interaural time differences of clicks: Excitatory and inhibitory components,” J. Neurophysiol. JONEA4 62, 144–161. - PubMed

Publication types

MeSH terms