Vowel decoding from single-trial speech-evoked electrophysiological responses: A feature-based machine learning approach
- PMID: 28638700
- PMCID: PMC5474698
- DOI: 10.1002/brb3.665
Vowel decoding from single-trial speech-evoked electrophysiological responses: A feature-based machine learning approach
Abstract
Introduction: Scalp-recorded electrophysiological responses to complex, periodic auditory signals reflect phase-locked activity from neural ensembles within the auditory system. These responses, referred to as frequency-following responses (FFRs), have been widely utilized to index typical and atypical representation of speech signals in the auditory system. One of the major limitations in FFR is the low signal-to-noise ratio at the level of single trials. For this reason, the analysis relies on averaging across thousands of trials. The ability to examine the quality of single-trial FFRs will allow investigation of trial-by-trial dynamics of the FFR, which has been impossible due to the averaging approach.
Methods: In a novel, data-driven approach, we used machine learning principles to decode information related to the speech signal from single trial FFRs. FFRs were collected from participants while they listened to two vowels produced by two speakers. Scalp-recorded electrophysiological responses were projected onto a low-dimensional spectral feature space independently derived from the same two vowels produced by 40 speakers, which were not presented to the participants. A novel supervised machine learning classifier was trained to discriminate vowel tokens on a subset of FFRs from each participant, and tested on the remaining subset.
Results: We demonstrate reliable decoding of speech signals at the level of single-trials by decomposing the raw FFR based on information-bearing spectral features in the speech signal that were independently derived.
Conclusions: Taken together, the ability to extract interpretable features at the level of single-trials in a data-driven manner offers unchartered possibilities in the noninvasive assessment of human auditory function.
Keywords: EEG; frequency‐following responses; speech decoding; vowels.
Figures
References
-
- Aiken, S. J. , & Picton, T. W. (2008). Envelope and spectral frequency‐following responses to vowel sounds. Hearing Research, 245, 35–47. - PubMed
-
- Anderson, L. A. , & Malmierca, M. S. (2013). The effect of auditory cortex deactivation on stimulus‐specific adaptation in the inferior colliculus of the rat. European Journal of Neuroscience, 37, 52–62. - PubMed
-
- Banai, K. , Abrams, D. , & Kraus, N. (2007). Sensory‐based learning disability: Insights from brainstem processing of speech sounds. International Journal of Audiology, 46, 524–532. - PubMed
-
- Bidelman, G. M. (2014). Objective information‐theoretic algorithm for detecting brainstem‐evoked responses to complex stimuli. Journal of the American Academy of Audiology, 25, 715–726. - PubMed
-
- Bidelman, G. M. , Moreno, S. , & Alain, C. (2013). Tracing the emergence of categorical speech perception in the human auditory system. NeuroImage, 79, 201–212. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
