Musicians Show Improved Speech Segregation in Competitive, Multi-Talker Cocktail Party Scenarios

Gavin M Bidelman^{1

2

3}, Jessica Yoo²

Affiliations

¹ Institute for Intelligent Systems, University of Memphis, Memphis, TN, United States.
² School of Communication Sciences and Disorders, University of Memphis, Memphis, TN, United States.
³ Department of Anatomy and Neurobiology, University of Tennessee Health Sciences Center, Memphis, TN, United States.

PMID: 32973610
PMCID: PMC7461890
DOI: 10.3389/fpsyg.2020.01927

Musicians Show Improved Speech Segregation in Competitive, Multi-Talker Cocktail Party Scenarios

Gavin M Bidelman et al. Front Psychol. 2020.

. 2020 Aug 18:11:1927.

doi: 10.3389/fpsyg.2020.01927. eCollection 2020.

Authors

Gavin M Bidelman^{1

2

3}, Jessica Yoo²

Affiliations

¹ Institute for Intelligent Systems, University of Memphis, Memphis, TN, United States.
² School of Communication Sciences and Disorders, University of Memphis, Memphis, TN, United States.
³ Department of Anatomy and Neurobiology, University of Tennessee Health Sciences Center, Memphis, TN, United States.

PMID: 32973610
PMCID: PMC7461890
DOI: 10.3389/fpsyg.2020.01927

Abstract

Studies suggest that long-term music experience enhances the brain's ability to segregate speech from noise. Musicians' "speech-in-noise (SIN) benefit" is based largely on perception from simple figure-ground tasks rather than competitive, multi-talker scenarios that offer realistic spatial cues for segregation and engage binaural processing. We aimed to investigate whether musicians show perceptual advantages in cocktail party speech segregation in a competitive, multi-talker environment. We used the coordinate response measure (CRM) paradigm to measure speech recognition and localization performance in musicians vs. non-musicians in a simulated 3D cocktail party environment conducted in an anechoic chamber. Speech was delivered through a 16-channel speaker array distributed around the horizontal soundfield surrounding the listener. Participants recalled the color, number, and perceived location of target callsign sentences. We manipulated task difficulty by varying the number of additional maskers presented at other spatial locations in the horizontal soundfield (0-1-2-3-4-6-8 multi-talkers). Musicians obtained faster and better speech recognition amidst up to around eight simultaneous talkers and showed less noise-related decline in performance with increasing interferers than their non-musician peers. Correlations revealed associations between listeners' years of musical training and CRM recognition and working memory. However, better working memory correlated with better speech streaming. Basic (QuickSIN) but not more complex (speech streaming) SIN processing was still predicted by music training after controlling for working memory. Our findings confirm a relationship between musicianship and naturalistic cocktail party speech streaming but also suggest that cognitive factors at least partially drive musicians' SIN advantage.

Keywords: acoustic scene analysis; experience-dependent plasticity; musical training; speech-in-noise perception; stream segregation.

PubMed Disclaimer

Figures

**FIGURE 1**
Cocktail party streaming task. **(A)** Participants were seated in the center of a 16-channel speaker array within an anechoic chamber. Speaker heights were positioned at ear level (∼130 cm) during the task with a radial distance of 160 cm to the center of the head and speaker-to-speaker distance of ∼20°. **(B)** Example stimulus presentation (three- and six-talker conditions). Participants were asked to recall the color, number, and perceived location of target callsign sentences from the coordinate response measure (CRM) corpus (Bolia et al., 2000). Target location was varied randomly from trial to trial and occurred simultaneous with between zero and eight concurrent masking talkers.

**FIGURE 2**
Cocktail party listening is superior in musicians. **(A)** Speech recognition declines with increasing masker counts in both groups, but musicians show less performance decrement up to eight interfering talkers (*inset*). Dotted line = chance performance. **(B)** Musicians show faster (∼200–400 ms) speech recognition speeds than non-musicians. **(C)** Both groups localized correctly identified targets within two speakers (<40° error) with better localization in musicians. Error bars = ± 1 s.e.m.

**FIGURE 3**
Cognitive skills are superior in musicians. **(A)** Raven’s fluid IQ and **(B)** auditory working memory are enhanced in musicians. **(C)** Musicians also obtain ∼1 dB lower reception thresholds on the QuickSIN test, consistent with the notion of a musician advantage in speech-in-noise (SIN) perception. No group differences were observed in sustained attention (data not shown). Error bars = ± 1 s.e.m. *p < 0.05, ****p < 0.0001.

**FIGURE 4**
Correlation results. **(A)** Formal music training predicts musicians’ perceptual–cognitive advantages in working memory (WM) and speech streaming at the cocktail party. More extensive music training is associated with better auditory WM and shallower masker-related declines in speech streaming (see Figure 2A, inset). **(B)** Speech streaming is also related to WM; higher WM capacity predicts better cocktail party performance.

See this image and copyright information in PMC

Cited by

Attention-Driven Modulation of Auditory Cortex Activity during Selective Listening in a Multispeaker Setting.
Puschmann S, Regev M, Fakhar K, Zatorre RJ, Thiel CM. Puschmann S, et al. J Neurosci. 2024 Apr 10;44(15):e1157232023. doi: 10.1523/JNEUROSCI.1157-23.2023. J Neurosci. 2024. PMID: 38388426 Free PMC article.
Transcranial Direct Current Stimulation Combined With Listening to Preferred Music Alters Cortical Speech Processing in Older Adults.
Bidelman GM, Chow R, Noly-Gandon A, Ryan JD, Bell KL, Rizzi R, Alain C. Bidelman GM, et al. Front Neurosci. 2022 Jul 6;16:884130. doi: 10.3389/fnins.2022.884130. eCollection 2022. Front Neurosci. 2022. PMID: 35873829 Free PMC article.
Auditory working memory mechanisms mediating the relationship between musicianship and auditory stream segregation.
Liu M, Arseneau-Bruneau I, Farrés Franch M, Latorre ME, Samuels J, Issa E, Payumo A, Rahman N, Loureiro N, Leung TCM, Nave KM, von Handorf KM, Hoddinott JD, Coffey EBJ, Grahn J, Zatorre RJ. Liu M, et al. Front Psychol. 2025 Mar 28;16:1538511. doi: 10.3389/fpsyg.2025.1538511. eCollection 2025. Front Psychol. 2025. PMID: 40226491 Free PMC article.
Are musical activities associated with enhanced speech perception in noise in adults? A systematic review and meta-analysis.
Maillard E, Joyal M, Murray MM, Tremblay P. Maillard E, et al. Curr Res Neurobiol. 2023 Mar 24;4:100083. doi: 10.1016/j.crneur.2023.100083. eCollection 2023. Curr Res Neurobiol. 2023. PMID: 37397808 Free PMC article. Review.
The Listener Effect in Multitalker Speech Segregation and Talker Identification.
Lutfi RA, Rodriguez B, Lee J. Lutfi RA, et al. Trends Hear. 2021 Jan-Dec;25:23312165211051886. doi: 10.1177/23312165211051886. Trends Hear. 2021. PMID: 34693853 Free PMC article.

See all "Cited by" articles

References

1. Alain C. (2007). Breaking the wave: effects of attention and learning on concurrent sound perception. Hear. Res. 229 225–236. 10.1016/j.heares.2007.01.011 - DOI - PubMed
1. Anaya E. M., Pisoni D. P., Kronenberger W. G. (2016). Long-term musical experience and auditory and visual perceptual abilities under adverse conditions. J. Acoust. Soc. Am. 140 2074–2081. 10.1121/1.4962628 - DOI - PMC - PubMed
1. Başkent D., Fuller C. D., Galvin J. J., Schepel L., Gaudrain E., Free R. H. (2018). Musician effect on perception of spectro-temporally degraded speech, vocal emotion, and music in young adolescents. J. Acoust. Soc. Am. 143 EL311–EL316. - PubMed
1. Benjamini Y., Hochberg Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Statist. Soc. Ser. B 57 289–300. 10.1111/j.2517-6161.1995.tb02031.x - DOI
1. Bialystok E., Depape A. M. (2009). Musical expertise, bilingualism, and executive functioning. J. Exper. Psychol. Hum. Percept. Perform. 35 565–574. 10.1037/a0012735 - DOI - PubMed

Grants and funding

R01 DC016267/DC/NIDCD NIH HHS/United States

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Musicians Show Improved Speech Segregation in Competitive, Multi-Talker Cocktail Party Scenarios

Affiliations

Musicians Show Improved Speech Segregation in Competitive, Multi-Talker Cocktail Party Scenarios

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources