Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2012 Feb 21;109(8):E505-14.
doi: 10.1073/pnas.1113427109. Epub 2012 Feb 1.

Phoneme and word recognition in the auditory ventral stream

Affiliations
Meta-Analysis

Phoneme and word recognition in the auditory ventral stream

Iain DeWitt et al. Proc Natl Acad Sci U S A. .

Abstract

Spoken word recognition requires complex, invariant representations. Using a meta-analytic approach incorporating more than 100 functional imaging experiments, we show that preference for complex sounds emerges in the human auditory ventral stream in a hierarchical fashion, consistent with nonhuman primate electrophysiology. Examining speech sounds, we show that activation associated with the processing of short-timescale patterns (i.e., phonemes) is consistently localized to left mid-superior temporal gyrus (STG), whereas activation associated with the integration of phonemes into temporally complex patterns (i.e., words) is consistently localized to left anterior STG. Further, we show left mid- to anterior STG is reliably implicated in the invariant representation of phonetic forms and that this area also responds preferentially to phonetic sounds, above artificial control sounds or environmental sounds. Together, this shows increasing encoding specificity and invariance along the auditory ventral stream for temporally complex speech sounds.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
Foci meeting inclusion criteria for length-based CS analyses (AC) and ALE-statistic maps for regions of significant concordance (DF) (p < 10−3, k > 150 cm3). Analyses show leftward bias and an anterior progression in peak effects with phoneme-length studies showing greatest concordance in left mid-STG (A and D; n = 14), word-length studies showing greatest concordance in left anterior STG (B and E; n = 16), and phrase-length analyses showing greatest concordance in left anterior STS (C and F; n = 19). Sample size is given with respect to the number of contrasts from independent experiments contributing to an analysis.
Fig. 2.
Fig. 2.
Foci meeting liberal inclusion criteria for lexically based word-length CS analyses (A and B) and ALE-statistic maps for regions of significant concordance (C and D) (p < 10−3, k > 150 cm3). Similar to the CS analyses in Fig. 1, a leftward bias and an anterior progression in peak effects are shown. Pseudoword studies show greatest concordance in left mid- to anterior STG (A and C; n = 13). Notably, the distribution of concordance effects is bimodal, peaking both in mid- (−60, −26, 6) and anterior (−56, −10, 2) STG. Real-word studies show greatest concordance in left anterior STG (B and D; n = 22).
Fig. 3.
Fig. 3.
Foci meeting inclusion criteria for length-based RS analyses (A and B) and ALE-statistic maps for regions of significant concordance (C and D) (p < 10−3, k > 150 cm3). Analyses show left lateralization and an anterior progression in peak effects with phoneme-length studies showing greatest concordance in left mid-STG (A and C; n = 12) and word-length studies showing greatest concordance in left anterior STG (B and D; n = 16). Too few studies exist for phrase-length analyses (n = 4).
Fig. 4.
Fig. 4.
Foci meeting inclusion criteria for length-pooled analyses (AC) and ALE-statistic maps for regions of significant concordance (DF) (p < 10−3, k > 150 cm3). Analyses show leftward bias in the CS (A and D; n = 49) and AS (C and F; n = 15) analyses and left lateralization in the IR (B and E; n = 11) analysis. Foci are color coded by stimulus length: phoneme length, red; word length, green; and phrase length, blue.
Fig. 5.
Fig. 5.
Flat-map presentation of ALE cluster overlap for (A) the CS analyses shown in Fig. 1, (B) the word-length lexical status analyses shown in Fig. 2, (C) the RS analyses shown in Fig. 3, and (D) the length-pooled analyses shown in Fig. 4. For orientation, prominent landmarks are shown on the left hemisphere of A, including the circular sulcus (CirS), central sulcus (CS), STG, and STS.
Fig. P1.
Fig. P1.
Meta-analyses of auditory word processing. Matched-control sound studies (AC), providing evidence for pattern specificity in speech processing, show a leftward bias and an anterior progression in peak effects with phoneme-length studies’ peak focus density in left mid-STG (A; n = 14), word-length studies’ peak density in left anterior STG (B; n = 16), and phrase-length studies’ peak density in left anterior STS (C; n = 19). Repetition-suppression studies (D and E) show left lateralization and an anterior progression in peak effects, with phoneme-length studies’ peak density in left mid-STG (D; n = 12) and word-length studies’ peak density in left anterior STG (E; n = 16). Peak density in studies of invariant representation (F; n = 11) and areal specialization (G; n = 15) is, respectively, left-lateralized and leftward-biased. Intensity represents activation likelihood estimation statistic value (P value for false discovery rate < 10−3, k > 150 cm3).

References

    1. Steinschneider M. Unlocking the role of the superior temporal gyrus for speech sound categorization. J Neurophysiol. 2011;105:2631–2633. - PubMed
    1. Brugge JF, Merzenich MM. Responses of neurons in auditory cortex of the macaque monkey to monaural and binaural stimulation. J Neurophysiol. 1973;36:1138–1158. - PubMed
    1. Bitterman Y, Mukamel R, Malach R, Fried I, Nelken I. Ultra-fine frequency tuning revealed in single neurons of human auditory cortex. Nature. 2008;451:197–201. - PMC - PubMed
    1. Griffiths TD, Warren JD. What is an auditory object? Nat Rev Neurosci. 2004;5:887–892. - PubMed
    1. Leaver AM, Rauschecker JP. Cortical representation of natural complex sounds: effects of acoustic features and auditory object category. J Neurosci. 2010;30:7604–7612. - PMC - PubMed

Publication types

LinkOut - more resources