Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul:2025:1-7.
doi: 10.1109/EMBC58623.2025.11251577.

CA-NeuroSpex: Context-Informed Autoregressive Neuro-Guided Speaker Extraction

CA-NeuroSpex: Context-Informed Autoregressive Neuro-Guided Speaker Extraction

Dashanka De Silva et al. Annu Int Conf IEEE Eng Med Biol Soc. 2025 Jul.

Abstract

Neuro-guided target speaker extraction (TSE) leverages neural responses to guide the extraction of attended speech from competing sources, mirroring the brain's ability to navigate multi-speaker environments. However, traditional neuro-guided methods overlook the importance of temporal context. To bridge this gap, we introduce CA-NeuroSpex, a novel context-informed end-to-end TSE framework. It harnesses autoregressive feedback to integrate previously extracted speech as a secondary reference cue via a specialized speech-context encoder. By dynamically fusing this contextual cue with the neural cue, CA-NeuroSpex bolsters extraction performance in a causal decoder setup. Our key contributions include a speech-context encoder for overlapping speech integration, a teacher-forced autoregressive training paradigm, and a gating mechanism for cue fusion. Our results demonstrate the effectiveness of combining dynamic contextual and neural information for robust speaker extraction.

PubMed Disclaimer

LinkOut - more resources