Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Apr 2;5(2):ENEURO.0252-17.2018.
doi: 10.1523/ENEURO.0252-17.2018. eCollection 2018 Mar-Apr.

Sensorimotor Representation of Speech Perception. Cross-Decoding of Place of Articulation Features during Selective Attention to Syllables in 7T fMRI

Affiliations

Sensorimotor Representation of Speech Perception. Cross-Decoding of Place of Articulation Features during Selective Attention to Syllables in 7T fMRI

Mario E Archila-Meléndez et al. eNeuro. .

Abstract

Sensorimotor integration, the translation between acoustic signals and motoric programs, may constitute a crucial mechanism for speech. During speech perception, the acoustic-motoric translations include the recruitment of cortical areas for the representation of speech articulatory features, such as place of articulation. Selective attention can shape the processing and performance of speech perception tasks. Whether and where sensorimotor integration takes place during attentive speech perception remains to be explored. Here, we investigate articulatory feature representations of spoken consonant-vowel (CV) syllables during two distinct tasks. Fourteen healthy humans attended to either the vowel or the consonant within a syllable in separate delayed-match-to-sample tasks. Single-trial fMRI blood oxygenation level-dependent (BOLD) responses from perception periods were analyzed using multivariate pattern classification and a searchlight approach to reveal neural activation patterns sensitive to the processing of place of articulation (i.e., bilabial/labiodental vs. alveolar). To isolate place of articulation representation from acoustic covariation, we applied a cross-decoding (generalization) procedure across distinct features of manner of articulation (i.e., stop, fricative, and nasal). We found evidence for the representation of place of articulation across tasks and in both tasks separately: for attention to vowels, generalization maps included bilateral clusters of superior and posterior temporal, insular, and frontal regions; for attention to consonants, generalization maps encompassed clusters in temporoparietal, insular, and frontal regions within the right hemisphere only. Our results specify the cortical representation of place of articulation features generalized across manner of articulation during attentive syllable perception, thus supporting sensorimotor integration during attentive speech perception and demonstrating the value of generalization.

Keywords: 7T fMRI; MVPA-based cross-decoding; place of articulation; selective attention; sensorimotor; speech perception.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Stimuli and experimental paradigm. A, Spoken stimuli matrix and articulatory properties. The 18 syllables were selected according to the place of articulation (i.e., bilabial/labiodental and alveolar) and manner of articulation (i.e., stop, fricative, and nasal). B, Experimental procedure and task description. Example of a typical match-to-sample trial during attend to vowels task. Subjects received instructions per block, in which they attended to consonants or vowels, respectively, and conducted a match-to-sample decision within slow-event related trials. Each block started with the visual presentation of a task cue (i.e., attention target vowel or consonant), indicating which task to perform in the next 27 trials. Each trial started with a fixation period in which a fixation cross was presented at the center of screen, together with a syllable sound (340 ms). After a jittered ISI (jitter: 6–8 s) a visual cue (i.e., a written letter, vowel, or consonant) was presented for 2 s, followed by the immediate subject’s response by pressing a button either with the right index finger (for match trials) or middle index finger (for mismatch trials). The response was followed by a jittered ISI (4–10 s) to complete the jittered ITI period (12–16 s) before the next trial started. C, Schematic representation of the fMRI acquisition sequence and its relationship to the syllable sounds presented to the participants. Vow, attentd to vowels task; Con, attend to consonants task; Syll, syllable; TA, time of acquisition; TR, time for repetition; Phase, opposite phase encoding volumes acquired for distortion correction.
Figure 2.
Figure 2.
Brain areas that show speech sound processing during attend to vowels and attend to consonants tasks. Functional maps depicting the overall pattern of sound-evoked cortical responses during performance of (A) attend to vowels (i.e., all sounds attend to vowels > baseline) and (B) attend to consonants task (i.e., all sounds attend to consonants > baseline); p value at cluster threshold (p-clust) < 0.05. Maps are visualized on inflated and aligned group-averaged representations of the left (LH) and right (RH) hemispheres of the fourteen subjects (light gray, gyri; dark gray, sulci). Color scale indicates average β values.
Figure 3.
Figure 3.
Cortical representation of place of articulation features, generalized across manner of articulation. Generalization maps depicting classification accuracies during (A) attend to vowels and (B) attend to consonants tasks. Insula cluster represents the outline (i.e., while line) of the largest continuous uncorrected cluster from the differences between the two tasks; p value at cluster-size threshold (pclust) < 0.05. Searchlight radius 8.25 mm. Maps are visualized on inflated and aligned group-averaged representations of the left (LH) and right (RH) hemispheres of the fourteen subjects (light gray, gyri; dark gray, sulci). Color scale indicates classification accuracy (Acc) percentages.
Figure 4.
Figure 4.
Differences in classification of place of articulation features, generalized across manner of articulation. Maps depicting the difference in the classification accuracies between attend to vowels and attend to consonants tasks. The largest continuous cluster (i.e., area, 134 mm2) was found in the left anterior insula, outlined in white; p value uncorrected (puncorr) < 0.05. Searchlight radius 8.25 mm. Maps are visualized on inflated and aligned group-averaged representations of the left (LH) and right (RH) hemispheres of the fourteen subjects (light gray, gyri; dark gray, sulci). Yellow-orange clusters show larger classification accuracies during the attend to vowels task, and green-blue clusters show lager classification accuracies during attend to consonants task. Please note that the color scale does not directly relates to the colors used in Figure 3. Color scale indicates differences in the classification accuracy (Acc) percentages.
Figure 5.
Figure 5.
Cortical representation of place of articulation features, generalized across manner of articulation, pooling all trails from two tasks. Generalization maps depicting classification accuracies calculated by pooling all trails from attend to vowels and attend to consonants tasks. Corrected for multiple comparison with cluster-size thresholding. Insula cluster represents the outline (i.e., white line) of the largest continuous uncorrected cluster from the differences between the two tasks; p value at cluster-size threshold (pclust) < 0.05. Searchlight radius 8.25 mm. Maps are visualized on inflated and aligned group-averaged representations of the left (LH) and right (RH) hemispheres of the fourteen subjects (light gray, gyri; dark gray, sulci). Color scale indicates classification accuracy (Acc) percentages.

References

    1. Arsenault JS, Buchsbaum BR (2015) Distributed neural representations of phonological features during speech perception. J Neurosci 35:634–642. - PMC - PubMed
    1. Ashburner J, Friston KJ (2005) Unified segmentation. Neuroimage 26:839–851. 10.1016/j.neuroimage.2005.02.018 - DOI - PubMed
    1. Baldauf D, Desimone R (2014) Neural mechanisms of object-based attention. Science 344:424–427. 10.1126/science.1247003 - DOI - PubMed
    1. Baldo JV, Wilkins DP, Ogar J, Willock S, Dronkers NF (2011) Role of the precentral gyrus of the insula in complex articulation. Cortex 47:800–807. 10.1016/j.cortex.2010.07.001 - DOI - PubMed
    1. Binder JR, Desai RH, Graves WW, Conant LL (2009) Where is the semantic system? A critical review and meta-analysis of 120 functional neuroimaging studies. Cereb Cortex 19:2767–2796. 10.1093/cercor/bhp055 - DOI - PMC - PubMed

Publication types

LinkOut - more resources