Distributional learning for speech reflects cumulative exposure to a talker's phonetic distributions
- PMID: 30604404
- PMCID: PMC6559869
- DOI: 10.3758/s13423-018-1551-5
Distributional learning for speech reflects cumulative exposure to a talker's phonetic distributions
Abstract
Efficient speech perception requires listeners to maintain an exquisite tension between stability of the language architecture and flexibility to accommodate variation in the input, such as that associated with individual talker differences in speech production. Achieving this tension can be guided by top-down learning mechanisms, wherein lexical information constrains interpretation of speech input, and by bottom-up learning mechanisms, in which distributional information in the speech signal is used to optimize the mapping to speech sound categories. An open question for theories of perceptual learning concerns the nature of the representations that are built for individual talkers: do these representations reflect long-term, global exposure to a talker or rather only short-term, local exposure? Recent research suggests that when lexical knowledge is used to resolve a talker's ambiguous productions, listeners disregard previous experience with a talker and instead rely on only recent experience, a finding that is contrary to predictions of Bayesian belief-updating accounts of perceptual adaptation. Here we use a distributional learning paradigm in which lexical information is not explicitly required to resolve ambiguous input to provide an additional test of global versus local exposure accounts. Listeners completed two blocks of phonetic categorization for stimuli that differed in voice-onset-time, a probabilistic cue to the voicing contrast in English stop consonants. In each block, two distributions were presented, one specifying /g/ and one specifying /k/. Across the two blocks, variance of the distributions was manipulated to be either narrow or wide. The critical manipulation was order of the two blocks; half of the listeners were first exposed to the narrow distributions followed by the wide distributions, with the order reversed for the other half of the listeners. The results showed that for earlier trials, the identification slope was steeper for the narrow-wide group compared to the wide-narrow group, but this difference was attenuated for later trials. The between-group convergence was driven by an asymmetry in learning between the two orders such that only those in the narrow-wide group showed slope movement during exposure, a pattern that was mirrored by computational simulations in which the distributional statistics of the present talker were integrated with prior experience with English. This pattern of results suggests that listeners did not disregard all prior experience with the talker, and instead used cumulative exposure to guide phonetic decisions, which raises the possibility that accommodating a talker's phonetic signature entails maintaining representations that reflect global experience.
Keywords: Computational models; Distributional learning; Perceptual learning; Speech perception.
Figures



Similar articles
-
Talker-specific influences on phonetic category structure.J Acoust Soc Am. 2015 Aug;138(2):1068-78. doi: 10.1121/1.4927489. J Acoust Soc Am. 2015. PMID: 26328722
-
Listener sensitivity to individual talker differences in voice-onset-time.J Acoust Soc Am. 2004 Jun;115(6):3171-83. doi: 10.1121/1.1701898. J Acoust Soc Am. 2004. PMID: 15237841
-
Listeners are maximally flexible in updating phonetic beliefs over time.Psychon Bull Rev. 2018 Apr;25(2):718-724. doi: 10.3758/s13423-017-1376-7. Psychon Bull Rev. 2018. Retraction in: Psychon Bull Rev. 2020 Aug;27(4):819. doi: 10.3758/s13423-020-01765-0. PMID: 28924946 Free PMC article. Retracted.
-
Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel.Psychol Rev. 2015 Apr;122(2):148-203. doi: 10.1037/a0038695. Psychol Rev. 2015. PMID: 25844873 Free PMC article. Review.
-
Why are listeners hindered by talker variability?Psychon Bull Rev. 2024 Feb;31(1):104-121. doi: 10.3758/s13423-023-02355-6. Epub 2023 Aug 14. Psychon Bull Rev. 2024. PMID: 37580454 Free PMC article. Review.
Cited by
-
From first encounters to longitudinal exposure: a repeated exposure-test paradigm for monitoring speech adaptation.Front Psychol. 2024 May 30;15:1383904. doi: 10.3389/fpsyg.2024.1383904. eCollection 2024. Front Psychol. 2024. PMID: 38873525 Free PMC article.
-
Computational Modeling of an Auditory Lexical Decision Experiment Using DIANA.Lang Speech. 2023 Sep;66(3):564-605. doi: 10.1177/00238309221111752. Epub 2022 Aug 24. Lang Speech. 2023. PMID: 36000386 Free PMC article.
-
The Role of the Right Hemisphere in Processing Phonetic Variability Between Talkers.Neurobiol Lang (Camb). 2021 Feb 1;2(1):138-151. doi: 10.1162/nol_a_00028. eCollection 2021. Neurobiol Lang (Camb). 2021. PMID: 37213418 Free PMC article. Review.
-
A second chance for a first impression: Sensitivity to cumulative input statistics for lexically guided perceptual learning.Psychon Bull Rev. 2021 Jun;28(3):1003-1014. doi: 10.3758/s13423-020-01840-6. Epub 2021 Jan 14. Psychon Bull Rev. 2021. PMID: 33443706
-
SingleMALD: Investigating practice effects in auditory lexical decision.Behav Res Methods. 2025 Apr 2;57(5):136. doi: 10.3758/s13428-025-02628-z. Behav Res Methods. 2025. PMID: 40175775 Free PMC article.
References
-
- Hillenbrand J, Getty LA, Clark MJ, & Wheeler K (1995). Acoustic characteristics of American English vowels. Journal of the Acoustical society of America, 97(5), 3099–3111. - PubMed
-
- Jongman A, Wayland R, & Wong S (2000). Acoustic characteristics of English fricatives. Journal of the Acoustical Society of America, 108(3), 1252–1263. - PubMed
-
- Kleinschmidt DF (2017). beliefupdatr: Belief updating for phonetic adaptation. R package version 0.0.3.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources