Decoding spoken phonemes from sensorimotor cortex with high-density ECoG grids
- PMID: 28993231
- PMCID: PMC6433278
- DOI: 10.1016/j.neuroimage.2017.10.011
Decoding spoken phonemes from sensorimotor cortex with high-density ECoG grids
Abstract
For people who cannot communicate due to severe paralysis or involuntary movements, technology that decodes intended speech from the brain may offer an alternative means of communication. If decoding proves to be feasible, intracranial Brain-Computer Interface systems can be developed which are designed to translate decoded speech into computer generated speech or to instructions for controlling assistive devices. Recent advances suggest that such decoding may be feasible from sensorimotor cortex, but it is not clear how this challenge can be approached best. One approach is to identify and discriminate elements of spoken language, such as phonemes. We investigated feasibility of decoding four spoken phonemes from the sensorimotor face area, using electrocorticographic signals obtained with high-density electrode grids. Several decoding algorithms including spatiotemporal matched filters, spatial matched filters and support vector machines were compared. Phonemes could be classified correctly at a level of over 75% with spatiotemporal matched filters. Support Vector machine analysis reached a similar level, but spatial matched filters yielded significantly lower scores. The most informative electrodes were clustered along the central sulcus. Highest scores were achieved from time windows centered around voice onset time, but a 500 ms window before onset time could also be classified significantly. The results suggest that phoneme production involves a sequence of robust and reproducible activity patterns on the cortical surface. Importantly, decoding requires inclusion of temporal information to capture the rapid shifts of robust patterns associated with articulator muscle group contraction during production of a phoneme. The high classification scores are likely to be enabled by the use of high density grids, and by the use of discrete phonemes. Implications for use in Brain-Computer Interfaces are discussed.
Keywords: Brain-computer interface; Decoding; ECoG; Language; Phonemes.
Copyright © 2017 Elsevier Inc. All rights reserved.
Figures






Similar articles
-
Decoding Single and Paired Phonemes Using 7T Functional MRI.Brain Topogr. 2024 Sep;37(5):731-747. doi: 10.1007/s10548-024-01034-6. Epub 2024 Jan 23. Brain Topogr. 2024. PMID: 38261272 Free PMC article.
-
Decoding of articulatory gestures during word production using speech motor and premotor cortical activity.Annu Int Conf IEEE Eng Med Biol Soc. 2015;2015:5339-42. doi: 10.1109/EMBC.2015.7319597. Annu Int Conf IEEE Eng Med Biol Soc. 2015. PMID: 26737497
-
Repeated Vowel Production Affects Features of Neural Activity in Sensorimotor Cortex.Brain Topogr. 2019 Jan;32(1):97-110. doi: 10.1007/s10548-018-0673-4. Epub 2018 Sep 20. Brain Topogr. 2019. PMID: 30238309 Free PMC article.
-
Brain-Computer Interface: Applications to Speech Decoding and Synthesis to Augment Communication.Neurotherapeutics. 2022 Jan;19(1):263-273. doi: 10.1007/s13311-022-01190-2. Epub 2022 Jan 31. Neurotherapeutics. 2022. PMID: 35099768 Free PMC article. Review.
-
Brain-to-speech decoding will require linguistic and pragmatic data.J Neural Eng. 2018 Dec;15(6):063001. doi: 10.1088/1741-2552/aae466. Epub 2018 Sep 26. J Neural Eng. 2018. PMID: 30256217 Review.
Cited by
-
Human motor cortex relies on sparse and action-specific activation during laughing, smiling and speech production.Commun Biol. 2019 Mar 26;2:118. doi: 10.1038/s42003-019-0360-3. eCollection 2019. Commun Biol. 2019. PMID: 30937400 Free PMC article.
-
Decoding four hand gestures with a single bipolar pair of electrocorticography electrodes.J Neural Eng. 2021 Oct 22;18(5):10.1088/1741-2552/ac2c9f. doi: 10.1088/1741-2552/ac2c9f. J Neural Eng. 2021. PMID: 34607318 Free PMC article.
-
Decoding Imagined and Spoken Phrases From Non-invasive Neural (MEG) Signals.Front Neurosci. 2020 Apr 7;14:290. doi: 10.3389/fnins.2020.00290. eCollection 2020. Front Neurosci. 2020. PMID: 32317917 Free PMC article.
-
The influence of prior pronunciations on sensorimotor cortex activity patterns during vowel production.J Neural Eng. 2018 Dec;15(6):066025. doi: 10.1088/1741-2552/aae329. Epub 2018 Sep 21. J Neural Eng. 2018. PMID: 30238924 Free PMC article.
-
Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices.Front Neurosci. 2019 Nov 22;13:1267. doi: 10.3389/fnins.2019.01267. eCollection 2019. Front Neurosci. 2019. PMID: 31824257 Free PMC article.
References
-
- Blakely T, Miller KJ, Rao RPN, Holmes MD, Ojemann JG. Localization and classification of phonemes using high spatial resolution electrocorticography (ECoG) grids. Conf Proc Annu Int Conf IEEE Eng Med Biol Soc IEEE Eng Med Biol Soc Annu Conf. 2008;2008:4964–4967. doi: 10.1109/IEMBS.2008.4650328. - DOI - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources