Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Clinical Trial
. 2013;8(1):e53398.
doi: 10.1371/journal.pone.0053398. Epub 2013 Jan 10.

The tracking of speech envelope in the human cortex

Affiliations
Clinical Trial

The tracking of speech envelope in the human cortex

Jan Kubanek et al. PLoS One. 2013.

Abstract

Humans are highly adept at processing speech. Recently, it has been shown that slow temporal information in speech (i.e., the envelope of speech) is critical for speech comprehension. Furthermore, it has been found that evoked electric potentials in human cortex are correlated with the speech envelope. However, it has been unclear whether this essential linguistic feature is encoded differentially in specific regions, or whether it is represented throughout the auditory system. To answer this question, we recorded neural data with high temporal resolution directly from the cortex while human subjects listened to a spoken story. We found that the gamma activity in human auditory cortex robustly tracks the speech envelope. The effect is so marked that it is observed during a single presentation of the spoken story to each subject. The effect is stronger in regions situated relatively early in the auditory pathway (belt areas) compared to other regions involved in speech processing, including the superior temporal gyrus (STG) and the posterior inferior frontal gyrus (Broca's region). To further distinguish whether speech envelope is encoded in the auditory system as a phonological (speech-related), or instead as a more general acoustic feature, we also probed the auditory system with a melodic stimulus. We found that belt areas track melody envelope weakly, and as the only region considered. Together, our data provide the first direct electrophysiological evidence that the envelope of speech is robustly tracked in non-primary auditory cortex (belt areas in particular), and suggest that the considered higher-order regions (STG and Broca's region) partake in a more abstract linguistic analysis.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. High gamma activity in human auditory cortex tracks the envelope of speech.
Black: Time course of the speech envelope. Green: High gamma activity recorded by a channel positioned in the belt areas in subject C (see Figure 5) while the subject listened to a narrated story. For the visualization purpose of this figure, we graphically scaled the magnitude of the neural signal to the magnitude of the envelope signal. The Spearman correlation between the two signals is formula image.
Figure 2
Figure 2. Tracking of speech envelope for each channel located in the belt areas.
The Spearman correlation coefficient (formula image) between the speech envelope and gamma activity for each channel in belt areas surrounding the auditory core. Filled bars denote the cases of significant correlation (formula image). The star refers to the example channel shown in Figure 1 and Figure 5.
Figure 3
Figure 3. High gamma activity tracks the envelope of speech better than does the raw potential.
Meanformula imageSEM Spearman correlation coefficient (formula image) between each neural signal and speech envelope. The mean is computed across all channels in the belt areas (formula image).
Figure 4
Figure 4. Tracking of speech envelope in three auditory cortical regions.
Meanformula imageSEM Spearman correlation coefficient (formula image) between neural activity and speech envelope in each region of interest. The mean is computed over all channels in each area (formula image, formula image, formula image). Stars denote the significance of the difference in means (two-tailed t-test), *formula image, **formula image.
Figure 5
Figure 5. Neural tracking of speech envelope at each recording site in each subject.
Color hue (see colorbars) gives formula image at each channel for the individual subjects (A–E), and for the subject average (AVG). Individual channels implanted in each subject are shown in green (belt areas), orange (STG), red (Broca's region), or black (other regions). The location of each channel was determined using the Talairach Atlas daemon (see Methods). In subject C, the arrow points to the channel for which we illustrated the tracking effect (Figure 1). STS: superior temporal sulcus; SF: Sylvian fissure; TTS: transverse temporal sulcus (perpendicular to the view plane).
Figure 6
Figure 6. Tracking of speech envelope in early auditory regions compared to all other regions.
Meanformula imageSEM Spearman correlation coefficient (formula image) between neural activity and speech envelope in belt areas (green) and all other regions (gray). The mean is computed over all channels in each case (formula image, formula image).
Figure 7
Figure 7. Neural tracking of speech envelope at each frequency of the sound.
Meanformula imageSEM Spearman correlation coefficient (formula image) between neural activity and speech envelope for each frequency, for each region of interest. The dashed line gives the average spectrum of the speech in logarithmic units.
Figure 8
Figure 8. Tracking of speech-related and melodic stimuli in human cortex.
Meanformula imageSEM Spearman correlation coefficient (formula image) between neural activity and the speech envelope, for each region of interest. Left: stimulus containing lyrics. Right: melody. The mean is computed across all channels in each area. Stars denote the significance of the difference in means (t-test), *formula image, **formula image.

Similar articles

Cited by

References

    1. Stevens K (2000) Acoustic phonetics, volume 30. The MIT press.
    1. Rosen S (1992) Temporal information in speech: acoustic, auditory and linguistic aspects. Philos Trans R Soc Lond, B, Biol Sci 336: 367–73. - PubMed
    1. Zatorre RJ, Belin P, Penhune VB (2002) Structure and function of auditory cortex: music and speech. Trends Cogn Sci (Regul Ed) 6: 37–46. - PubMed
    1. Hickok G, Poeppel D (2007) The cortical organization of speech processing. Nat Rev Neurosci 8: 393–402. - PubMed
    1. Rauschecker JP, Scott SK (2009) Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nature Neuroscience 12: 718–24. - PMC - PubMed

Publication types