. 2008;65(1-2):19-44.

doi: 10.1159/000130014. Epub 2008 May 28.

Spectral integration of dynamic cues in the perception of syllable-initial stops

Robert Allen Fox¹, Ewa Jacewicz, Lawrence L Feth

Affiliations

PMID: 18523365
PMCID: PMC2849108
DOI: 10.1159/000130014

Spectral integration of dynamic cues in the perception of syllable-initial stops

Robert Allen Fox et al. Phonetica. 2008.

. 2008;65(1-2):19-44.

doi: 10.1159/000130014. Epub 2008 May 28.

Authors

Robert Allen Fox¹, Ewa Jacewicz, Lawrence L Feth

Affiliation

¹ Department of Speech and Hearing Science, The Ohio State University, Columbus, Ohio 43210-1002, USA. fox.2@osu.edu

PMID: 18523365
PMCID: PMC2849108
DOI: 10.1159/000130014

Abstract

The present experiments examine the potential role of auditory spectral integration and spectral center of gravity (COG) effects in the perception of initial formant transitions in the syllables [da]-[ga] and [t(h)a]-[k(h)a]. Of interest is whether the place distinction for stops in these syllables can be cued by a 'virtual F3 transition' in which the percept of a frequency transition is produced by a dynamically changing COG. Listeners perceived the virtual F3 transitions comparably with actual F3 transitions although the former were less salient a cue. However, in a separate experiment, static 'virtual F3 bursts' were not as effective as actual F3 bursts in cueing the alveolar-velar place distinction. These results indicate that virtual F3 transitions can provide phonetic information to the perceptual system and that auditory spectral integration (completed by the central auditory system) may play a significant role in speech perception.

PubMed Disclaimer

Figures

**Fig. 1**
Schematic representation of the actual F3 transition series; note that the frequency of the F3 onset varies from 1,907 to 2,596 Hz.

**Fig. 2**
The perceived frequency of the actual (left) and the virtual F3 transition (right) as predicted by the IWAIF model. The steady-state portions of F3 are also included.

**Fig. 3**
Schematic representation of the virtual F3 transition series; note that the center frequencies of the two 50-ms noise-excited resonances remain constant.

**Fig. 4**
Spectrograms of the endpoint stimuli (a, c) and an intermediate stimulus (b) for the actual [da]-[ga] series. The versions of these tokens containing virtual F3 transitions are displayed in d, f and e, respectively.

**Fig. 5**
Identification functions of responses to the actual F3 transition and the virtual F3 transition series. Values along the abscissa represent the onset F3 frequency used in the synthesis of the actual formant transition or, for the virtual transitions, the effective onset frequency of the IWAIF-modeled spectral COG for the two noise-excited resonances.

**Fig. 6**
Identification functions (rising/not rising) of responses to the actual F3 transitions and the virtual F3 transition series. Values along the abscissa represent the onset F3 frequency used in the synthesis of the actual formant transition or, for the virtual transitions, the effective onset frequency of the IWAIF-modeled COG for the two noise-excited resonances.

**Fig. 7**
Spectrograms of the endpoint stimuli (a, c) and an intermediate stimulus (b) for the actual [t^ha]-[k^ha] series. The versions of these tokens containing virtual F3 transitions are displayed in d, f and e, respectively.

**Fig. 8**
Identification functions of responses to the actual F3 transition voiceless series and the virtual F3 transition voiceless series. Values along the abscissa represent the onset F3 frequency used in the synthesis of the actual formant transition or, for the virtual transitions, the effective onset frequency of the IWAIF-modeled spectral COG for the two noise-excited resonances.

**Fig. 9**
Schematic representation of the actual F3 burst series; each stimulus token in the series uses the [t^ha]-[k^ha] base token.

**Fig. 10**
Schematic representation of the virtual F3 burst series; each stimulus token in the series uses the [t^ha]-[k^ha] base token.

**Fig. 11**
Identification functions of responses to the actual and virtual F3 burst series. Values along the abscissa represent the onset F3 frequency used in the synthesis of the actual F3 burst or, for the virtual F3 bursts, the effective onset frequency of the IWAIF-modeled spectral COG for the bursts centered at 1,907 and 2,861 Hz.

See this image and copyright information in PMC

References

1. Anantharaman JN, Krishnamurthy AK, Feth L. Intensity-weighted average of instantaneous frequency as a model for frequency discrimination. J acoust Soc Am. 1993;94:723–729. - PubMed
1. Assmann PF. The perception of back vowels: centre of gravity hypothesis. Q Jl exp Psychol. 1991;43:423–448. - PubMed
1. Beddor PS, Hawkins S. The influence of spectral prominence on perceived vowel quality. J acoust Soc Am. 1990;87:2684–2704. - PubMed
1. Bedrov YA, Chistovich LA, Sheikin RL. Frequency location of the ‘center of gravity’ of formants as a useful feature in vowel perception. Akust Zh . 24:480–486.
2. Soviet Phys Acoust. 1978;24:275–282.
1. Bladon A. Two-formant models of vowel perception: shortcomings and enhancements. Speech Commun. 1983;2:305–313.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

R01 DC006879/DC/NIDCD NIH HHS/United States

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Spectral integration of dynamic cues in the perception of syllable-initial stops

Affiliation

Spectral integration of dynamic cues in the perception of syllable-initial stops

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous