Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul;84(5):1772-1787.
doi: 10.3758/s13414-022-02486-3. Epub 2022 Apr 26.

Revisiting the target-masker linguistic similarity hypothesis

Affiliations

Revisiting the target-masker linguistic similarity hypothesis

Violet A Brown et al. Atten Percept Psychophys. 2022 Jul.

Abstract

The linguistic similarity hypothesis states that it is more difficult to segregate target and masker speech when they are linguistically similar. For example, recognition of English target speech should be more impaired by the presence of Dutch masking speech than Mandarin masking speech because Dutch and English are more linguistically similar than Mandarin and English. Across four experiments, English target speech was consistently recognized more poorly when presented in English masking speech than in silence, speech-shaped noise, or an unintelligible masker (i.e., Dutch or Mandarin). However, we found no evidence for graded masking effects-Dutch did not impair performance more than Mandarin in any experiment, despite 650 participants being tested. This general pattern was consistent when using both a cross-modal paradigm (in which target speech was lipread and maskers were presented aurally; Experiments 1a and 1b) and an auditory-only paradigm (in which both the targets and maskers were presented aurally; Experiments 2a and 2b). These findings suggest that the linguistic similarity hypothesis should be refined to reflect the existing evidence: There is greater release from masking when the masker language differs from the target speech than when it is the same as the target speech. However, evidence that unintelligible maskers impair speech identification to a greater extent when they are more linguistically similar to the target language remains elusive.

Keywords: Cross-modal masking; Linguistic similarity hypothesis; Masking; Speech identification.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
LTAS across all babble tracks for each linguistic masker shown on a log amplitude scale. Disparities in LTAS before normalization (A), especially at higher frequencies, are reduced after normalization (B). Note that the Mandarin recordings obtained from a previous study on the linguistic similarity hypothesis were recorded at a sampling rate of 16 kHz, which can only accurately represent frequencies up to 8 kHz given Nyquist Sampling (Nyquist, 1928; see also Weisstein, 2022). Both types of English babble and Dutch babble were recorded at 22 kHz.
Figure 2.
Figure 2.
By-participant average lipreading accuracy in each of the six conditions in Experiment 1a, ordered from lowest to highest accuracy. The dot represents the mean accuracy and the shape represents the distribution of responses for each condition. The accompanying table can be found in the Supplementary Materials (Table S2).
Figure 3.
Figure 3.
By-participant mean lipreading accuracy in each of the three conditions in Experiment 1b, ordered from lowest to highest accuracy. The dot represents the mean accuracy and the shape represents the distribution of responses for each condition. The accompanying table can be found in the Supplementary Materials (Table S3).
Figure 4.
Figure 4.
By-participant mean sentence identification accuracy in each of the four conditions in Experiment 2a, ordered from lowest to highest accuracy. The dot represents the mean accuracy and the shape represents the distribution of responses for each condition. The accompanying table can be found in the Supplementary Materials (Table S4).
Figure 5.
Figure 5.
By-participant mean sentence identification accuracy in each of the four conditions in Experiment 2b, ordered from lowest to highest accuracy. The dot represents the mean accuracy and the shape represents the distribution of responses for each condition. The accompanying table can be found in the Supplementary Materials (Table S5).

References

    1. Agus TR, Akeroyd MA, Gatehouse S, & Warden D (2009). Informational masking in young and elderly listeners for speech masked by simultaneous speech and noise. The Journal of the Acoustical Society of America, 126(4), 1926–1940. - PubMed
    1. Anwyl-Irvine AL, Massonnié J, Flitton A, Kirkham N, & Evershed JK (2020). Gorilla in our midst: An online behavioral experiment builder. Behavior Research Methods, 52(1), 388–407. - PMC - PubMed
    1. Barr DJ, Levy R, Scheepers C, & Tily HJ (2013). Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language, 68(3). 10.1016/j.jml.2012.11.001 - DOI - PMC - PubMed
    1. Bates D, Maechler M, Bolker B, Walker S, Christensen R, Singmann H, Dai B, Scheipl F, Grothendieck G, & Green P (2014). Package “lme4” (Version 1.1-15). R foundation for statistical computing, Vienna, 12. https://github.com/lme4/lme4/
    1. Bench J, Kowal A, & Bamford J (1979). The BKB (Bamford-Kowal-Bench) sentence lists for partially-hearing children. British Journal of Audiology, 13(3), 108–112. - PubMed

LinkOut - more resources