Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jul 23;11(1):15117.
doi: 10.1038/s41598-021-94662-y.

Signal envelope and speech intelligibility differentially impact auditory motion perception

Affiliations

Signal envelope and speech intelligibility differentially impact auditory motion perception

Michaela Warnecke et al. Sci Rep. .

Abstract

Our acoustic environment contains a plethora of complex sounds that are often in motion. To gauge approaching danger and communicate effectively, listeners need to localize and identify sounds, which includes determining sound motion. This study addresses which acoustic cues impact listeners' ability to determine sound motion. Signal envelope (ENV) cues are implicated in both sound motion tracking and stimulus intelligibility, suggesting that these processes could be competing for sound processing resources. We created auditory chimaera from speech and noise stimuli and varied the number of frequency bands, effectively manipulating speech intelligibility. Normal-hearing adults were presented with stationary or moving chimaeras and reported perceived sound motion and content. Results show that sensitivity to sound motion is not affected by speech intelligibility, but shows a clear difference for original noise and speech stimuli. Further, acoustic chimaera with speech-like ENVs which had intelligible content induced a strong bias in listeners to report sounds as stationary. Increasing stimulus intelligibility systematically increased that bias and removing intelligible content reduced it, suggesting that sound content may be prioritized over sound motion. These findings suggest that sound motion processing in the auditory system can be biased by acoustic parameters related to speech intelligibility.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Speech intelligibility across conditions for each ENV type. Speech intelligibility (mean ± sem) was measured as a function of chimaera and control conditions, with stimulus ENV type indicated by color. As the number of frequency bands increased, intelligibility of sound content increased for chimaera with a speech-like ENV (blue), but decreased for chimaera with a noise-like ENV (red). During control conditions, listener’s performance on sound content intelligibility reached ceiling levels for original speech (ORS), but floor levels for SMN (ORSMN) and both reversed control stimuli (RS/RSMN).
Figure 2
Figure 2
Sensitivity to sound motion across conditions for each ENV type. Sensitivity scores (d’; mean ± sem) are plotted as a function of chimaera and control conditions, with ENV type indicated by color. Higher scores indicate better discriminability. Across chimaera conditions, listeners were significantly more sensitive to sound motion for stimuli with noise-like ENVs (red) compared to speech-like ENVs (blue). In control conditions, original stimuli did not differ from chimaera stimuli, but listeners were significantly less sensitive to sound motion when stimuli were speech (ORS) compared to SMN (ORSMN).
Figure 3
Figure 3
Response bias for sound motion across conditions for each ENV type. Response bias (c; mean ± sem) is plotted as a function of chimaera and control conditions, with ENV type indicated by color. Positive numbers indicate a bias towards judging sounds as moving, while negative numbers indicate a bias towards judging sounds as stationary (grey arrows). Statistical significance (see “Results”) is indicated by stars. Listeners showed a stationary perceptual bias for chimaera stimuli that had a speech-like ENV (blue) and at least six frequency bands. By contrast, chimaera stimuli with a noise-like ENV (red) and 4, 8 or 16 frequency bands induced a moving perceptual bias. Bias differed significantly for control conditions, in which listeners showed no bias for ORSMN, while the strongest stationary bias was induced for ORS. Removing sound content intelligibility, RS, removed the stationary bias that was observed in listeners when the same sound’s content was intelligible, SC16.

References

    1. Bregman AS. Auditory Scene Analysis: The perceptual organization of sound. MIT Press; 1990.
    1. Kidd G, Jr, Arbogast TL, Mason CR, Gallun FJ. The advantage of knowing where to listen. J. Acoust. Soc. Am. 2005;118:3804–3815. doi: 10.1121/1.2109187. - DOI - PubMed
    1. Freyman RL, Helfer KS, McCall DD, Clifton RK. The role of perceived spatial separation in the unmasking of speech. J. Acoust. Soc. Am. 1999;106:3578–3588. doi: 10.1121/1.428211. - DOI - PubMed
    1. Kreitewolf J, Lewald J, Getzmann S. Effect of attention on cortical processing of sound motion: an EEG study. Neuroimage. 2011;54:2340–2349. doi: 10.1016/j.neuroimage.2010.10.031. - DOI - PubMed
    1. Getzmann S, Lewald J. The effect of spatial adaptation on auditory motion processing. Hear. Res. 2011;272:21–29. doi: 10.1016/j.heares.2010.11.005. - DOI - PubMed

Publication types