Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar;240(3):813-824.
doi: 10.1007/s00221-021-06281-8. Epub 2022 Jan 20.

Can visual capture of sound separate auditory streams?

Affiliations

Can visual capture of sound separate auditory streams?

Chiara Valzolgher et al. Exp Brain Res. 2022 Mar.

Abstract

In noisy contexts, sound discrimination improves when the auditory sources are separated in space. This phenomenon, named Spatial Release from Masking (SRM), arises from the interaction between the auditory information reaching the ear and spatial attention resources. To examine the relative contribution of these two factors, we exploited an audio-visual illusion in a hearing-in-noise task to create conditions in which the initial stimulation to the ears is held constant, while the perceived separation between speech and masker is changed illusorily (visual capture of sound). In two experiments, we asked participants to identify a string of five digits pronounced by a female voice, embedded in either energetic (Experiment 1) or informational (Experiment 2) noise, before reporting the perceived location of the heard digits. Critically, the distance between target digits and masking noise was manipulated both physically (from 22.5 to 75.0 degrees) and illusorily, by pairing target sounds with visual stimuli either at same (audio-visual congruent) or different positions (15 degrees offset, leftward or rightward: audio-visual incongruent). The proportion of correctly reported digits increased with the physical separation between the target and masker, as expected from SRM. However, despite effective visual capture of sounds, performance was not modulated by illusory changes of target sound position. Our results are compatible with a limited role of central factors in the SRM phenomenon, at least in our experimental setting. Moreover, they add to the controversial literature on the limited effects of audio-visual capture in auditory stream separation.

Keywords: Hearing in noise; Sound localization; Spatial release from masking; Visual capture of sound.

PubMed Disclaimer

References

    1. Amenta S, Artesini L, Musola D, Frau GN, Vespignani F, Pavani F (2020) Probing language processing in cochlear implant users with visual word recognition: effects of lexical and orthographic word properties. Lang Cogn Neurosci 36(2):1–12. https://doi.org/10.1080/23273798.2020.18046 - DOI
    1. Arbogast TL, Mason CR, Kidd G (2002) The effect of spatial separation on informational and energetic masking of speech. J Acoust Soc Am 112(5):2086–2098. https://doi.org/10.1121/1.1510141 - DOI - PubMed
    1. Barton K (2018) MuMIn: multi-model inference. R package. Cran-R, 1, 289–290. Retrieved November 2020 from https://cran.r-project.org/web/packages/MuMIn/MuMIn.pdf
    1. Bates D, Mächler M, Bolker BM, Walker SC (2015) Fitting linear mixed-effects models using lme4. J Stat Softw. https://doi.org/10.18637/jss.v067.i01 - DOI
    1. Bertelson P (1999) Ventriloquism: a case of crossmodal perceptual grouping. Adv Psychol 129:347–362. https://doi.org/10.1016/S0166-4115(99)80034-X - DOI

LinkOut - more resources