. 2000 Apr-Dec;57(2-4):152-69.

doi: 10.1159/000028469.

Modeling and perception of 'gesture reduction'

R Carré¹, P L Divenyi

Affiliations

PMID: 10992136
PMCID: PMC1360169
DOI: 10.1159/000028469

Modeling and perception of 'gesture reduction'

R Carré et al. Phonetica. 2000 Apr-Dec.

. 2000 Apr-Dec;57(2-4):152-69.

doi: 10.1159/000028469.

Authors

R Carré¹, P L Divenyi

Affiliation

¹ ENST, Unité Associée au CNRS, Paris, France. carre@tsi.enst.fr

PMID: 10992136
PMCID: PMC1360169
DOI: 10.1159/000028469

Abstract

The phenomenon of vowel reduction is investigated by modeling 'gesture reduction' with the use of the Distinctive Region Model (DRM). First, a definition is proposed for the term gesture, i.e. an acoustically efficient command aimed at deforming, in the time domain, the area function of the vocal tract. Second, tests are reported on the perception of vowel-to-vowel transitions obtained with reduced gestures. These tests show that a dual representation of formant transitions is required to explain the reduction phenomenon: the trajectory in the F(1)-F(2) plane and the time course of the formant changes. The results also suggest that time-domain integration of the trajectories constitutes an integral part of the auditory processing of transitions. Perceptual results are also discussed in terms of the acoustic traces of DRM gestures.

PubMed Disclaimer

Figures

**Fig. 1.**
Schematic diagram of the vocal tract deformation gestures for an [iV₂i] sequence. The vocal tract shape of [i] is shown as the solid line, whereas the one for the (arbitrary) V₂ is shown as the broken line. The gesture for [iV₂] is indicated by the solide arrow (1 and associated 1) and that for the [V₂i] by the broken arrow (2 and associated 2).

**Fig. 2.**
Schematic representation of the eight [iV₂i] sequences with different degrees of constriction, synthesized to be used as stimuli in experiment 1a. a DRM command amplitude (arbitrary units) as a function of time. b F₁ and F₂ transitions corresponding to the gesture in a, shown in a spectrogram plot. The temporal representation of F₃ is also shown (dotted line) for the case in which the [a] target is reached. c F₁-F₂ plot of the eight formant trajectories in b. Note that all V₂ vowels (shown as points) fall on the [ia] trajectory and can be interpreted as incomplete [iai] transitions, except for the rightmost point. The time labels on some of the points refer to the time of return to the final [i] vowel, before completing the [ia] transition, that is, the time value of vowel reduction.

**Fig. 3.**
Experiments 1a and 1b. a Average labeling results (and standard deviation) of 5 listeners in experiment 1a for V₂ in a [iV₂i] context. Ordinate: percent responses; V₂ = /a/ (filled diamonds), V₂ =/ε/ (squares), V₂ = /e/ (triangles). Abscissa: point of return of the transition to [i] prior to reaching [a]. b Average labeling results (and standard deviation) of 1 listener in experiment 1a for V₂ in a [iV₂i] context. c Average labeling results (and standard deviation) of 5 listeners in experiment 1b for a steady-state V₂ vowel. Ordinate: percent responses; V₂ = /a/ (filled diamonds), V₂ = /ε/ (squares), V₂ = /e/ (triangles). Abscissa: F₁-F₂ value of the V₂ vowel (fig. 2c) shown as the point of return of the transition to [i] prior to reaching [a], had the vowel been the midpoint of an [iV₂i] transition.

**Fig. 4.**
Experiment 1c. a Spectrogram representation of the five different temporal patterns of the formant transitions; note that the transition for all tokens reaches the same V₂ and that the total duration of the transition is always constant at 150 ms. b F₁-F₂ plane representation of the transitions: note that the extreme value, i.e. the V₂ vowel corresponds to the formant values of the vowel [ε]. c Average labeling results (and standard deviation) of 5 listeners in experiment 1c for V₂ in a [iai] context with changing transition durations. Ordinate: percent responses; V₂ = /a/ (filled diamonds), V₂ = /ε/ (squares), V₂ = /e/ (triangles). Abscissa: Token numbers corresponding to the five different transition slopes (a).

**Fig. 5.**
Average labeling results (and standard deviation) in experiment 1d using a V₂ value corresponding to a steady-state [ε]. Ordinate: percent responses; V₂ = /a/ (filled diamonds), V₂ =/ ε / (squares). Abscissa: total duration of the token.

**Fig. 6.**
Experiment 2a. a Temporal representation of the F₁ and F₂ transitions of eight of the ten tokens used in the [aV₂a] experiment where only two tokens completed the transition to the vowel [y]. The other six had their transitions progressively cut back, i.e. their formant frequencies returned toward those of [a] after reaching various V₂ endpoint vowels on the trajectory. The temporal representations of F₃ is also shown (dotted line) for the case in which the [y] target is reached. b F₁-F₂ plane representation of the [aya] transition for the eight tokens shown above. Note that, in contrast to the [iai] trajectory (fig. 2c), the [aya] trajectory is curved. c Results (average and standard deviation) of experiment 2a: Percent /aya/ (filled diamonds), /aia/ (squares), /ala/ (triangles) responses as a function of the duration of [y] or the transition cutback point (i.e. the point of return to [a]) where 0 ms refers to the condition in which the vowel [y] is reached but the transition immediately returns toward [a]. Note that 100% /y/ responses were obtained only for the 30-ms ‘positive cutback’ condition, i.e. for the condition in which there was a 30-ms steady-state [y] before the transition actually took a turn back to [a]. Also, note that the intersubject variability is much larger than the one observed in the subtests of experiment 1.

**Fig. 7.**
Experiments 2b and 2c. a Temporal representation of the F₁ and F₂ transitions of the eight tokens used in the [aV₂a] experiment where only two tokens completed the transition to the vowel [y]. The other six had their transitions progressively cut back,i.e. their formant frequencies returned toward those of [a] after reaching various V₂ endpoint vowels on the trajectory. The temporal representation of F₃ is also shown (dotted line) for the case in which the [y] target iy reached. b F₁-F₂ plane representation of the [aya] transition for the eight tokens used in experiment 2b (solid line). Note that, compared with the [aya] trajectory (broken line) shown in figure 6b, the trajectory is symmetrically curved. c Results of experiment 2b: Percent /aya/ (filled diamonds), /aØa/ (squares), /aœa/ (triangles) responses as a function of the duration of [y] or the transition cutback point (i.e. the point of return to [a]) where 0 ms refers to the condition in which the vowel [y] is reached but the transition immediately returns toward [a]. Note that /aya/ responses were obtained for the -20 ms ‘negative cutback’ condition, i.e. for the condition in which there was a 20 ms before reaching the target [y]. Average results of 5 subjects. d Results in experiment 2c: Labeling of steady-state V₂ vowels. Ordinate: percent responses V₂ = /y/ (filled diamonds), V₂ = /Ø/ (squares), V₂ = /œ/ (triangles). Abscissa: F₁-F₂ value of the V₂ vowel shown as the point of return of the transition to [a] prior to reaching [y], had the vowel been the midpoint of an [aV₂a] transition.

**Fig. 8.**
Experiment 2d. a Temporal representation of the F₁ and F₂ transitions of the seven tokens used in the [aV₂a] experiment where six tokens completed the transition to the vowel [i]. The last one had its transition cut back, i.e. its formant frequencies returned toward those of [a] after reaching V₂ endpoint vowel on the trajectory. The temporal representation of F₃ is also shown (dotted line) for the case in which the [i] target is reached. b F₁-F₂ plane representation of the [aia] transition for the seven tokens used in experiment 2d (solid line). Note that, compared with the [aya] trajectory (broken line) shown in figure 7b, the trajectory is also curved but reaches [i]. c Results of experiment 2d: Percent /aia/ (filled diamonds) and /aya/ (squares) responses as a function of the duration of [i] or the transition cutback point (i.e. the point of return to [a]) where 0 ms refers to the condition in which the vowel [y] is reached but the transition immediately returns toward [a].

See this image and copyright information in PMC

Cited by

Relation of vocal tract shape, formant transitions, and stop consonant identification.
Story BH, Bunton K. Story BH, et al. J Speech Lang Hear Res. 2010 Dec;53(6):1514-28. doi: 10.1044/1092-4388(2010/09-0127). Epub 2010 Jul 19. J Speech Lang Hear Res. 2010. PMID: 20643794 Free PMC article.
Perception of complete and incomplete formant transitions in vowels.
Divenyi P. Divenyi P. J Acoust Soc Am. 2009 Sep;126(3):1427-39. doi: 10.1121/1.3167482. J Acoust Soc Am. 2009. PMID: 19739756 Free PMC article.

References

1. d’Alessandro C, Castellengo M. The pitch of short duration vibrato tones. J. acoust. Soc. Am. 1994;95:1617–1630.
1. Badin P, Fant G.198453–107.Notes on the vocal tract computations. Q. Prog. Status Rep., Speech Transm. Lab., R. Inst. Technol., Stockh., No. 2/3
1. Beautemps D.1993. Récupération des gestes de la parole ὰ partir de trajectoires formantiques: identification de cibles vocaliques non atteintes et modèles pour les profils sagittaux des consonnes fricatives; thèse Institut National Polytechnique, Grenoble
1. Browman C, Goldstein L. Ewan, Anderson, Phonol. Yb. Cambridge University Press; Cambridge: 1986. Towards an articulatory phonology; pp. 219–252.
1. Brownlee SA. The role of sentence stress in vowel reduction and formant undershoot: a study of lab speech and informal spontaneous speech. University of Texas; Austin: 1996. PhD thesis.

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

Grants and funding

R01 AG007998-10A1/AG/NIA NIH HHS/United States

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Modeling and perception of 'gesture reduction'

Affiliation

Modeling and perception of 'gesture reduction'

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources