Decoding EEG Brain Activity for Multi-Modal Natural Language Processing
- PMID: 34326723
- PMCID: PMC8314009
- DOI: 10.3389/fnhum.2021.659410
Decoding EEG Brain Activity for Multi-Modal Natural Language Processing
Abstract
Until recently, human behavioral data from reading has mainly been of interest to researchers to understand human cognition. However, these human language processing signals can also be beneficial in machine learning-based natural language processing tasks. Using EEG brain activity for this purpose is largely unexplored as of yet. In this paper, we present the first large-scale study of systematically analyzing the potential of EEG brain activity data for improving natural language processing tasks, with a special focus on which features of the signal are most beneficial. We present a multi-modal machine learning architecture that learns jointly from textual input as well as from EEG features. We find that filtering the EEG signals into frequency bands is more beneficial than using the broadband signal. Moreover, for a range of word embedding types, EEG data improves binary and ternary sentiment classification and outperforms multiple baselines. For more complex tasks such as relation detection, only the contextualized BERT embeddings outperform the baselines in our experiments, which raises the need for further research. Finally, EEG data shows to be particularly promising when limited training data is available.
Keywords: EEG; brain activity; frequency bands; machine learning; multi-modal learning; natural language processing; neural network; physiological data.
Copyright © 2021 Hollenstein, Renggli, Glaus, Barrett, Troendle, Langer and Zhang.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures





References
-
- Affolter N., Egressy B., Pascual D., Wattenhofer R. (2020). Brain2word: decoding brain activity for language generation. arXiv preprint arXiv:2009.04765.
-
- Alday P. M. (2019). M/EEG analysis of naturalistic stories: a review from speech to language processing. Lang. Cogn. Neurosci. 34, 457–473. 10.1080/23273798.2018.1546882 - DOI
-
- Arora S., May A., Zhang J., Ré C. (2020). Contextual embeddings: when are they worth it? arXiv preprint arXiv:2005.09117. 10.18653/v1/2020.acl-main.236 - DOI
-
- Artemova E., Bakarov A., Artemov A., Burnaev E., Sharaev M. (2020). Data-driven models and computational tools for neurolinguistics: a language technology perspective. J. Cogn. Sci. 21, 15–52.