. 2021 Sep 14;64(9):3361-3381.

doi: 10.1044/2021_JSLHR-21-00021. Epub 2021 Jul 26.

Auditory Feedback Is Used for Adaptation and Compensation in Speech Timing

Robin Karlin¹, Chris Naber¹, Benjamin Parrell^{1

2}

Affiliations

¹ Waisman Center, University of Wisconsin-Madison.
² Department of Communication Sciences and Disorders, University of Wisconsin-Madison.

PMID: 34310188
PMCID: PMC8642089
DOI: 10.1044/2021_JSLHR-21-00021

Auditory Feedback Is Used for Adaptation and Compensation in Speech Timing

Robin Karlin et al. J Speech Lang Hear Res. 2021.

. 2021 Sep 14;64(9):3361-3381.

doi: 10.1044/2021_JSLHR-21-00021. Epub 2021 Jul 26.

Authors

Robin Karlin¹, Chris Naber¹, Benjamin Parrell^{1

2}

Affiliations

¹ Waisman Center, University of Wisconsin-Madison.
² Department of Communication Sciences and Disorders, University of Wisconsin-Madison.

PMID: 34310188
PMCID: PMC8642089
DOI: 10.1044/2021_JSLHR-21-00021

Abstract

Purpose Real-time altered feedback has demonstrated a key role for auditory feedback in both online feedback control and in updating feedforward control for future utterances. The aim of this study was to examine adaptation in response to temporal perturbation using real-time perturbation of ongoing speech. Method Twenty native English speakers with no reported history of speech or hearing disorders participated in this study. The study consisted of four word blocks, using the phrases "a capper," "a gapper," "a sapper," and "a zapper" (due to issues with the implementation of perturbation, "gapper" was excluded from analysis). In each block, participants completed a baseline phase (30 trials of veridical feedback), a ramp phase (feedback perturbation increasing to maximum over 30 trials), a hold phase (60 trials with perturbation held at maximum), and a washout phase (30 trials, feedback abruptly returned to veridical feedback). Word-initial consonant targets (voice onset time for /k, g/ and fricative duration for /s, z/) were lengthened, and the following stressed vowel (/æ/) was shortened. Results Overall, speakers did not adapt the production of their consonants but did lengthen their vowel production in response to shortening. Vowel lengthening showed continued aftereffects during the early portion of the washout phase. Although speakers did not adapt absolute consonant durations, consonant duration was reduced as a proportion of the total syllable duration. This is consistent with previous research that suggests that speakers attend to proportional durations rather than absolute durations. Conclusion These results indicate that speakers actively monitor proportional durations and update the temporal dynamics of planning units extending beyond a single segment.

PubMed Disclaimer

Figures

**Figure 1.**
Examples of the input (top) and output (bottom) signals from the hold phase, including the lag between signals: (a) “a capper,” (b) “a sapper,” and (c) “a zapper.” Target segment durations are given in milliseconds below the spectrograms. Rectangles above the durations in the input signal indicate the time warp periods: Black indicates the signal that underwent time dilation; unfilled indicates the hold period; gray indicates the catch-up period. Dynamic range in the spectrogram has been set to make segment boundaries clearly visible; the additional noise visible in the output signal is due to the inclusion of white noise in playback to mask participants from hearing their true, unaltered speech.

**Figure 2.**
Change from baseline in consonant target duration. (a) By phase, averaged across participants (means ± standard error), including only data used in the model. (b) Behavior throughout the experiment, where each data point represents five trials, averaged across participants (means ± standard error). The dashed line indicates the beginning of the ramp phase, and the shaded area indicates the hold phase.

**Figure 3.**
Change from baseline in /æ/ duration. (a) By phase, averaged across participants (means ± standard error); stars indicate that that phase is significantly different from baseline for that word. (b) Behavior throughout the experiment, where each data point represents five trials, averaged across participants (means ± standard error). Note the consistency in vowel lengthening across participants, compared to the highly variable consonant target behavior.

**Figure 4.**
Change from baseline of proportion consonant target in initial consonant–vowel–consonant (CVC) syllable. (a) Averaged across participants (means ± standard error); stars indicate that that phase is significantly different from baseline for that word. (b) Behavior throughout the experiment, where each data point represents five trials, averaged across participants (means ± standard error).

**Figure 5.**
Change from baseline in /p/ duration. (a) By phase, averaged across participants (means ± standard error); stars indicate that that phase is significantly different from baseline for that word. (b) Behavior throughout the experiment, where each data point represents five trials, averaged across participants (means ± standard error).

**Figure 6.**
Change from baseline in /ɚ/ duration. (a) By phase, averaged across participants (means ± standard error); stars indicate that that phase is significantly different from baseline for that word. (b) Behavior throughout the experiment, where each data point represents five trials, averaged across participants (means ± standard error).

**Figure A1.**
Changes in consonant target duration, by participant. Dashed lines indicate insufficient perturbation.

**Figure A2.**
Changes in vowel duration, by participant. Dashed lines indicate insufficient perturbation.

**Figure A3.**
Changes in proportion consonant target, by participant. Dashed lines indicate insufficient perturbation (either consonant target or vowel target).

See this image and copyright information in PMC

References

1. Bates, D. , Maechler, M. , Bolker, B. , & Walker, S. (2014). Fitting linear mixed-effects models using Ime4. Journal of Statistical Software, 1(7), 1–23. https://doi.org/10.18637/jss.v067.i01
1. Baum, S. R. , & Blumstein, S. E. (1987). Preliminary observations on the use of duration as a cue to syllable-initial fricative consonant voicing in English. The Journal of the Acoustical Society of America, 82(3), 1073–1077. https://doi.org/10.1121/1.395382 - PubMed
1. Bjorndahl, C. (2018). A story of /v/: Voiced spirants in the obstruent-sonorant divide [Doctoral dissertation, Cornell University] .
1. Boucher, V. J. (2002). Timing relations in speech and the identification of voice-onset times: A stable perceptual boundary for voicing categories across speaking rates. Perception & Psychophysics, 64(1), 121–130. https://doi.org/10.3758/BF03194561 - PubMed
1. Browman, C. P. , & Goldstein, L. M. (1986). Towards an articulatory phonology. Phonology Yearbook, 3, 219–252. https://doi.org/10.1017/S0952675700000658

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Auditory Feedback Is Used for Adaptation and Compensation in Speech Timing

Affiliations

Auditory Feedback Is Used for Adaptation and Compensation in Speech Timing

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources