EEG decoding of the target speaker in a cocktail party scenario: considerations regarding dynamic switching of talker location
- PMID: 30836345
- DOI: 10.1088/1741-2552/ab0cf1
EEG decoding of the target speaker in a cocktail party scenario: considerations regarding dynamic switching of talker location
Abstract
Objective: It has been shown that attentional selection in a simple dichotic listening paradigm can be decoded offline by reconstructing the stimulus envelope from single-trial neural response data. Here, we test the efficacy of this approach in an environment with non-stationary talkers. We then look beyond the envelope reconstructions themselves and consider whether incorporating the decoder values-which reflect the weightings applied to the multichannel EEG data at different time lags and scalp locations when reconstructing the stimulus envelope-can improve decoding performance.
Approach: High-density EEG was recorded as subjects attended to one of two talkers. The two speech streams were filtered using HRTFs, and the talkers were alternated between the left and right locations at varying intervals to simulate a dynamic environment. We trained spatio-temporal decoders mapping from EEG data to the attended and unattended stimulus envelopes. We then decoded auditory attention by (1) using the attended decoder to reconstruct the envelope and (2) exploiting the fact that decoder weightings themselves contain signatures of attention, resulting in consistent patterns across subjects that can be classified.
Main results: The previously established decoding approach was found to be effective even with non-stationary talkers. Signatures of attentional selection and attended direction were found in the spatio-temporal structure of the decoders and were consistent across subjects. The inclusion of decoder weights into the decoding algorithm resulted in significantly improved decoding accuracies (from 61.07% to 65.31% for 4 s windows). An attempt was made to include alpha power lateralization as another feature to improve decoding, although this was unsuccessful at the single-trial level.
Significance: This work suggests that the spatial-temporal decoder weights can be utilised to improve decoding. More generally, looking beyond envelope reconstruction and incorporating other signatures of attention is an avenue that should be explored to improve selective auditory attention decoding.
Similar articles
-
Noise-robust cortical tracking of attended speech in real-world acoustic scenes.Neuroimage. 2017 Aug 1;156:435-444. doi: 10.1016/j.neuroimage.2017.04.026. Epub 2017 Apr 13. Neuroimage. 2017. PMID: 28412441
-
Where is the cocktail party? Decoding locations of attended and unattended moving sound sources using EEG.Neuroimage. 2020 Jan 15;205:116283. doi: 10.1016/j.neuroimage.2019.116283. Epub 2019 Oct 17. Neuroimage. 2020. PMID: 31629828
-
EEG-based auditory attention detection: boundary conditions for background noise and speaker positions.J Neural Eng. 2018 Dec;15(6):066017. doi: 10.1088/1741-2552/aae0a6. Epub 2018 Sep 12. J Neural Eng. 2018. PMID: 30207293
-
Neural Encoding of Attended Continuous Speech under Different Types of Interference.J Cogn Neurosci. 2018 Nov;30(11):1606-1619. doi: 10.1162/jocn_a_01303. Epub 2018 Jul 13. J Cogn Neurosci. 2018. PMID: 30004849 Review.
-
Decoding the dynamic tumor microenvironment.Sci Adv. 2021 Jun 4;7(23):eabi5904. doi: 10.1126/sciadv.abi5904. Print 2021 Jun. Sci Adv. 2021. PMID: 34088677 Free PMC article. Review.
Cited by
-
Attention Differentially Affects Acoustic and Phonetic Feature Encoding in a Multispeaker Environment.J Neurosci. 2022 Jan 26;42(4):682-691. doi: 10.1523/JNEUROSCI.1455-20.2021. Epub 2021 Dec 10. J Neurosci. 2022. PMID: 34893546 Free PMC article.
-
Robust EEG-Based Decoding of Auditory Attention With High-RMS-Level Speech Segments in Noisy Conditions.Front Hum Neurosci. 2020 Oct 7;14:557534. doi: 10.3389/fnhum.2020.557534. eCollection 2020. Front Hum Neurosci. 2020. PMID: 33132874 Free PMC article.
-
A Speech-Level-Based Segmented Model to Decode the Dynamic Auditory Attention States in the Competing Speaker Scenes.Front Neurosci. 2022 Feb 10;15:760611. doi: 10.3389/fnins.2021.760611. eCollection 2021. Front Neurosci. 2022. PMID: 35221885 Free PMC article.
-
Selective Auditory Attention Detection Using Combined Transformer and Convolutional Graph Neural Networks.Bioengineering (Basel). 2024 Nov 30;11(12):1216. doi: 10.3390/bioengineering11121216. Bioengineering (Basel). 2024. PMID: 39768034 Free PMC article.
-
Neural Speech Tracking during Selective Attention: A Spatially Realistic Audiovisual Study.eNeuro. 2025 Jun 24;12(6):ENEURO.0132-24.2025. doi: 10.1523/ENEURO.0132-24.2025. Print 2025 Jun. eNeuro. 2025. PMID: 40456616 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical