Phonetic feature encoding in human superior temporal gyrus

doi:10.1126/science.1245994

. 2014 Feb 28;343(6174):1006-10.

doi: 10.1126/science.1245994. Epub 2014 Jan 30.

Phonetic feature encoding in human superior temporal gyrus

Nima Mesgarani¹, Connie Cheung, Keith Johnson, Edward F Chang

Affiliations

Affiliation

¹ Department of Neurological Surgery, Department of Physiology, and Center for Integrative Neuroscience, University of California, San Francisco, CA 94143, USA.

PMID: 24482117
PMCID: PMC4350233
DOI: 10.1126/science.1245994

Phonetic feature encoding in human superior temporal gyrus

Nima Mesgarani et al. Science. 2014.

. 2014 Feb 28;343(6174):1006-10.

doi: 10.1126/science.1245994. Epub 2014 Jan 30.

Authors

Nima Mesgarani¹, Connie Cheung, Keith Johnson, Edward F Chang

Affiliation

¹ Department of Neurological Surgery, Department of Physiology, and Center for Integrative Neuroscience, University of California, San Francisco, CA 94143, USA.

PMID: 24482117
PMCID: PMC4350233
DOI: 10.1126/science.1245994

Abstract

During speech perception, linguistic elements such as consonants and vowels are extracted from a complex acoustic speech signal. The superior temporal gyrus (STG) participates in high-order auditory processing of speech, but how it encodes phonetic information is poorly understood. We used high-density direct cortical surface recordings in humans while they listened to natural, continuous speech to reveal the STG representation of the entire English phonetic inventory. At single electrodes, we found response selectivity to distinct phonetic features. Encoding of acoustic properties was mediated by a distributed population response. Phonetic features could be directly related to tuning for spectrotemporal acoustic cues, some of which were encoded in a nonlinear fashion or by integration of multiple cues. These findings demonstrate the acoustic-phonetic representation of speech in human STG.

PubMed Disclaimer

Figures

**Fig. 1. Human STG cortical selectivity to speech sounds**
(A) Magnetic resonance image surface reconstruction of one participant's cerebrum. Electrodes (red) are plotted with opacity signifying the t test value when comparing responses to silence and speech (P < 0.01, t test). (B) Example sentence and its acoustic waveform, spectrogram, and phonetic transcription. (C) Neural responses evoked by the sentence at selected electrodes. z score indicates normalized response. (D) Average responses at five example electrodes to all English phonemes and their PSI vectors.

**Fig. 2. Hierarchical clustering of single-electrode and population responses**
(A) PSI vectors of selective electrodes across all participants. Rows correspond to phonemes, and columns correspond to electrodes. (B) Clustering across population PSIs (rows). (C) Clustering across single electrodes (columns). (D) Alternative PSI vectors using rows now corresponding to phonetic features, not phonemes. (E) Weighted average STRFs of main electrode clusters. (F) Average acoustic spectrograms for phonemes in each population cluster. Correlation between average STRFs and average spectrograms: r = 0.67, P < 0.01, t test. (r = 0.50, 0.78, 0.55, 0.86, 0.86, and 0.47 for plosives, fricatives, vowels, and nasals, respectively; P < 0.01, t test).

**Fig. 3. Neural encoding of vowels**
(A) Formant frequencies, F1 and F2, for English vowels (F2-F1, dashed line, first principal component). (B) F1 and F2 partial correlations for each electrode's response (**P < 0.01, t test). Dots (electrodes) are color-coded by their cluster membership. (C) Neural population decoding of fundamental and formant frequencies. Error bars indicate SEM. (D) Multidimensional scaling (MDS) of acoustic and neural space (***P < 0.001, t test).

**Fig. 4. Neural encoding of plosive and fricative phonemes**
(A) Prediction accuracy of plosive and fricative acoustic parameters from neural population responses. Error bars indicate SEM. (B) Response of three example electrodes to all plosive phonemes sorted by VOT. (C) Nonlinearity of VOT-response transformation and (D) distributions of nonlinearity for all plosive-selective electrodes identified in Fig. 2D. Voiced plosive-selective electrodes are shown in pink, and the rest in gray. (E) Partial correlation values between response of electrodes and acoustic parameters shared between plosives and fricatives (**P < 0.01, t test). Dots (electrodes) are color-coded by their cluster grouping from Fig. 2C.

See this image and copyright information in PMC

Comment in

Neuroscience. The neural code that makes us human.
Grodzinsky Y, Nelken I. Grodzinsky Y, et al. Science. 2014 Feb 28;343(6174):978-9. doi: 10.1126/science.1251495. Science. 2014. PMID: 24578570 No abstract available.

Cited by

Against the Epistemological Primacy of the Hardware: The Brain from Inside Out, Turned Upside Down.
Poeppel D, Adolfi F. Poeppel D, et al. eNeuro. 2020 Aug 7;7(4):ENEURO.0215-20.2020. doi: 10.1523/ENEURO.0215-20.2020. Print 2020 Jul/Aug. eNeuro. 2020. PMID: 32769167 Free PMC article.
Neural dynamics of phoneme sequences reveal position-invariant code for content and order.
Gwilliams L, King JR, Marantz A, Poeppel D. Gwilliams L, et al. Nat Commun. 2022 Nov 3;13(1):6606. doi: 10.1038/s41467-022-34326-1. Nat Commun. 2022. PMID: 36329058 Free PMC article.
Australian English listeners' perception of Japanese vowel length reveals underlying phonological knowledge.
Yazawa K, Whang J, Escudero P. Yazawa K, et al. Front Psychol. 2023 Oct 26;14:1122471. doi: 10.3389/fpsyg.2023.1122471. eCollection 2023. Front Psychol. 2023. PMID: 37954175 Free PMC article.
Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks.
Bittar A, Garner PN. Bittar A, et al. Front Neurosci. 2024 Sep 25;18:1449181. doi: 10.3389/fnins.2024.1449181. eCollection 2024. Front Neurosci. 2024. PMID: 39385848 Free PMC article.
Phase-encoded fMRI tracks down brainstorms of natural language processing with subsecond precision.
Lei VLC, Leong TI, Leong CT, Liu L, Choi CU, Sereno MI, Li D, Huang RS. Lei VLC, et al. Hum Brain Mapp. 2024 Feb 1;45(2):e26617. doi: 10.1002/hbm.26617. Hum Brain Mapp. 2024. PMID: 38339788 Free PMC article.

See all "Cited by" articles

References

1. Chomsky N, Halle M. The Sound Pattern of English. Harper and Row; New York: 1968.
1. Binder JR, et al. Cereb Cortex. 2000;10:512–528. - PubMed
1. Boatman D, Hall C, Goldstein MH, Lesser R, Gordon B. Cortex. 1997;33:83–98. - PubMed
1. Chang EF, et al. Nat Neurosci. 2010;13:1428–1432. - PMC - PubMed
1. Formisano E, De Martino F, Bonte M, Goebel R. Science. 2008;322:970–973. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

[1] Chomsky N, Halle M. The Sound Pattern of English. Harper and Row; New York: 1968.

[2] Chomsky N, Halle M. The Sound Pattern of English. Harper and Row; New York: 1968.

[3] Binder JR, et al. Cereb Cortex. 2000;10:512–528. - PubMed

[4] Binder JR, et al. Cereb Cortex. 2000;10:512–528. - PubMed

[5] Boatman D, Hall C, Goldstein MH, Lesser R, Gordon B. Cortex. 1997;33:83–98. - PubMed

[6] Boatman D, Hall C, Goldstein MH, Lesser R, Gordon B. Cortex. 1997;33:83–98. - PubMed

[7] Chang EF, et al. Nat Neurosci. 2010;13:1428–1432. - PMC - PubMed

[8] Chang EF, et al. Nat Neurosci. 2010;13:1428–1432. - PMC - PubMed

[9] Formisano E, De Martino F, Bonte M, Goebel R. Science. 2008;322:970–973. - PubMed

[10] Formisano E, De Martino F, Bonte M, Goebel R. Science. 2008;322:970–973. - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Phonetic feature encoding in human superior temporal gyrus

Affiliation

Phonetic feature encoding in human superior temporal gyrus

Authors

Affiliation

Abstract

Figures

Comment in

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources