Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 May;141(5):3022.
doi: 10.1121/1.4982247.

Relative contributions of acoustic temporal fine structure and envelope cues for lexical tone perception in noise

Affiliations

Relative contributions of acoustic temporal fine structure and envelope cues for lexical tone perception in noise

Beier Qi et al. J Acoust Soc Am. 2017 May.

Abstract

Previous studies have shown that lexical tone perception in quiet relies on the acoustic temporal fine structure (TFS) but not on the envelope (E) cues. The contributions of TFS to speech recognition in noise are under debate. In the present study, Mandarin tone tokens were mixed with speech-shaped noise (SSN) or two-talker babble (TTB) at five signal-to-noise ratios (SNRs; -18 to +6 dB). The TFS and E were then extracted from each of the 30 bands using Hilbert transform. Twenty-five combinations of TFS and E from the sound mixtures of the same tone tokens at various SNRs were created. Twenty normal-hearing, native-Mandarin-speaking listeners participated in the tone-recognition test. Results showed that tone-recognition performance improved as the SNRs in either TFS or E increased. The masking effects on tone perception for the TTB were weaker than those for the SSN. For both types of masker, the perceptual weights of TFS and E in tone perception in noise was nearly equivalent, with E playing a slightly greater role than TFS. Thus, the relative contributions of TFS and E cues to lexical tone perception in noise or in competing-talker maskers differ from those in quiet and those to speech perception of non-tonal languages.

PubMed Disclaimer

Figures

FIG. 1.
FIG. 1.
Average tone-recognition scores across the 20 normal-hearing listeners in the SSN conditions. (Left) Tone-recognition scores as a function of SNR in TFS. Each line represents the data for SNR in E. (Middle) Tone-recognition scores as a function of SNR in E. Each line represents the data for SNR in TFS. (Right) Contour plot of the average tone-recognition scores. The abscissa and the ordinate represent SNR in E and in TFS, respectively. Color represents the tone-recognition performance as indicated by the color bar on the right.
FIG. 2.
FIG. 2.
Average tone-recognition scores across the 20 normal-hearing listeners in the TTB conditions. Conventions same as Fig. 1.
FIG. 3.
FIG. 3.
Correlation of E and TFS between the original and noise-mixed tone tokens. (Top) Correlation coefficients (r) of E and TFS between the original tone tokens and those mixed with two types of noise. The solid red lines and the dotted blue lines represent correlation coefficients for the E and TFS, respectively. The circle and triangle symbols represent SSN and TTB conditions, respectively. (Middle) Image plot of the mean correlation coefficients between those of E and TFS for the SSN conditions. The color of each element of the image represents the mean of the correlation coefficient of E at a particular SNR and that of TFS at a particular SNR. The white contour lines demarcate regions of similar values of the mean correlation coefficients. (Bottom) Same as the middle panel, but for the TTB conditions.

References

    1. Apoux, F. , and Healy, E. W. (2013). “ A glimpsing account of the role of temporal fine structure information in speech recognition,” Adv. Exp. Med. Biol. 787, 119–126.10.1007/978-1-4614-1590-9_14 - DOI - PubMed
    1. Apoux, F. , Yoho, S. E. , Youngdahl, C. L. , and Healy, E. W. (2013). “ Role and relative contribution of temporal envelope and fine structure cues in sentence recognition by normal-hearing listeners,” J. Acoust. Soc. Am. 134, 2205–2212.10.1121/1.4816413 - DOI - PMC - PubMed
    1. Boersma, P. , and Weenink, D. (2016). “ PRAAT: Doing phonetics by computer (version 6.0.21) [computer program],” http://www.praat.org (Last viewed 7 October 2016).
    1. Cabrera, L. , Tsao, F.-M. , Gnansia, D. , Bertoncini, J. , and Lorenzi, C. (2014). “ The role of spectro-temporal fine structure cues in lexical-tone discrimination for French and Mandarin listeners,” J. Acoust. Soc. Am. 136, 877–882.10.1121/1.4887444 - DOI - PubMed
    1. Darwin, C. (2005). “ Globally change the pitch and duration using a script for the PRAAT program,” http://www.lifesci.sussex.ac.uk/home/Chris_Darwin/Praatscripts/Fodurndyn... (Last viewed 23 April 2017).

Publication types