Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Apr 1:4:148.
doi: 10.3389/fpsyg.2013.00148. eCollection 2013.

Constraints on the transfer of perceptual learning in accented speech

Affiliations

Constraints on the transfer of perceptual learning in accented speech

Frank Eisner et al. Front Psychol. .

Abstract

The perception of speech sounds can be re-tuned through a mechanism of lexically driven perceptual learning after exposure to instances of atypical speech production. This study asked whether this re-tuning is sensitive to the position of the atypical sound within the word. We investigated perceptual learning using English voiced stop consonants, which are commonly devoiced in word-final position by Dutch learners of English. After exposure to a Dutch learner's productions of devoiced stops in word-final position (but not in any other positions), British English (BE) listeners showed evidence of perceptual learning in a subsequent cross-modal priming task, where auditory primes with devoiced final stops (e.g., "seed", pronounced [si:t(h)]), facilitated recognition of visual targets with voiced final stops (e.g., SEED). In Experiment 1, this learning effect generalized to test pairs where the critical contrast was in word-initial position, e.g., auditory primes such as "town" facilitated recognition of visual targets like DOWN. Control listeners, who had not heard any stops by the speaker during exposure, showed no learning effects. The generalization to word-initial position did not occur when participants had also heard correctly voiced, word-initial stops during exposure (Experiment 2), and when the speaker was a native BE speaker who mimicked the word-final devoicing (Experiment 3). The readiness of the perceptual system to generalize a previously learned adjustment to other positions within the word thus appears to be modulated by distributional properties of the speech input, as well as by the perceived sociophonetic characteristics of the speaker. The results suggest that the transfer of pre-lexical perceptual adjustments that occur through lexically driven learning can be affected by a combination of acoustic, phonological, and sociophonetic factors.

Keywords: cross-modal priming; foreign-accented speech; perceptual learning; speech.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Priming effects (reaction times in the unrelated minus the related prime type) are shown for each of the four word types, for experimental and control groups, in Experiment 1. The group differences for the /d/-final and /d/-initial word types illustrate learning for word-final position and generalization to word-initial position, respectively. Starred differences denote a significant interaction of group and prime type (related vs. unrelated).
Figure 2
Figure 2
Priming effects in Experiment 2, showing a group difference for the /d/-final words type but no other word type. Starred differences denote a significant interaction of group and prime type (related vs. unrelated).
Figure 3
Figure 3
Priming effects in Experiment 3. As in Experiment 2, there was a group difference for /d/-final words type but no other word type. Starred differences denote a significant interaction of group and prime type (related vs. unrelated).
Figure 4
Figure 4
Spectrograms of the words “pod” and “pot” as produced by the native English (left panels) and the Dutch speaker (right panels).
Figure 5
Figure 5
Duration measurements for both speakers across all minimal pairs with word-final stops in the experiments, shown separately for the voiced and unvoiced members of a pair (e.g., “pod” vs. “pot”). Duration distinctions made by the native speaker were reduced or absent for the Dutch speaker in the case of preceding vowel and closure, but mostly preserved for burst and aspiration. Error bars represent standard deviations.

References

    1. Adank P., Evans B. G., Stuart-Smith J., Scott S. K. (2009). Comprehension of familiar and unfamiliar native accents under adverse listening conditions. J. Exp. Psychol. Hum. Percept. Perform. 35, 520–52910.1037/a0013552 - DOI - PubMed
    1. Adank P., Janse E. (2009). Perceptual learning of time-compressed and natural fast speech. J. Acoust. Soc. Am. 126, 2649–265910.1121/1.3216914 - DOI - PubMed
    1. Baayen R. H., Piepenbrock R., Gulikers L. (1995). The CELEX Lexical Database. Philadelphia, PA: Linguistic Data Consortium, University of Pennsylvania
    1. Bent T., Buchwald A., Pisoni D. B. (2009). Perceptual adaptation and intelligibility of multiple talkers for two types of degraded speech. J. Acoust. Soc. Am. 126, 2660–266910.1121/1.3212930 - DOI - PMC - PubMed
    1. Bertelson P., Vroomen J., de Gelder B. (2003). Visual recalibration of auditory speech identification: a McGurk aftereffect. Psychol. Sci. 14, 592–59710.1046/j.0956-7976.2003.psci_1470.x - DOI - PubMed

LinkOut - more resources