Decoding and synthesizing tonal language speech from brain activity
- PMID: 37294753
- PMCID: PMC10256166
- DOI: 10.1126/sciadv.adh0478
Decoding and synthesizing tonal language speech from brain activity
Abstract
Recent studies have shown that the feasibility of speech brain-computer interfaces (BCIs) as a clinically valid treatment in helping nontonal language patients with communication disorders restore their speech ability. However, tonal language speech BCI is challenging because additional precise control of laryngeal movements to produce lexical tones is required. Thus, the model should emphasize the features from the tonal-related cortex. Here, we designed a modularized multistream neural network that directly synthesizes tonal language speech from intracranial recordings. The network decoded lexical tones and base syllables independently via parallel streams of neural network modules inspired by neuroscience findings. The speech was synthesized by combining tonal syllable labels with nondiscriminant speech neural activity. Compared to commonly used baseline models, our proposed models achieved higher performance with modest training data and computational costs. These findings raise a potential strategy for approaching tonal language speech restoration.
Figures
References
-
- M. Yip, Tone (Cambridge Univ. Press, 2002).
-
- M. S. Dryer, M. Haspelmath, The world atlas of language structures online (2013); http://wals.info [Accessed on 2021 September 21].
-
- J. D. McCawley, What is a tone language? in Tone (Elsevier, 1978), pp. 113–131.
-
- Y.-R. Chao, A System of Tone Letters (Le maître phonétique, 1930).
MeSH terms
LinkOut - more resources
Full Text Sources
