Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Oct 25;62(10):3655-3666.
doi: 10.1044/2019_JSLHR-S-19-0048. Epub 2019 Sep 16.

Categorization in the Perception of Breathy Voice Quality and Its Relation to Voice Production in Healthy Speakers

Affiliations

Categorization in the Perception of Breathy Voice Quality and Its Relation to Voice Production in Healthy Speakers

Yeonggwang Park et al. J Speech Lang Hear Res. .

Abstract

Purpose Previous studies of speech articulation have shown that individuals who can perceive smaller differences between similar-sounding phonemes showed larger contrasts in their productions of those phonemes. Here, a similar relationship was examined between the perception and production of breathy voice quality. Method Twenty females with healthy voices were recruited to participate in both a voice production and a perception experiment. Each participant produced repetitions of a sustained vowel, and acoustic correlates of breathiness were calculated. Identification and discrimination tasks were performed with a series of synthetic stimuli along a breathiness continuum. Categorical boundary location and boundary width were obtained from the identification task as a measurement of perception of breathiness. Spearman's correlation analysis was performed to estimate associations between values of boundary location and width and the acoustic correlates of breathiness from the participants' voices. Results Significant correlations between boundary width (r = -.53 to -.6) and some acoustic correlates were found, but no significant relationships were observed between boundary location and the acoustic correlates. Conclusions Speakers with small boundary widths, which suggest higher perceptual precision in differentiating breathiness, had typical voices that were less breathy, as estimated with acoustic measures, compared to speakers with large boundary widths. Our findings may support a link between perception and production of breathy voice quality. Supplemental Material https://doi.org/10.23641/asha.9808478.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Schematic of study hypotheses on the effects of boundary location (A) and boundary widths (B) on auditory targets. Acoustic space represents actual acoustic distances of the stimuli represented as white circles on the breathiness continuum. Perceptual space represents how listeners would perceive the stimuli, and shorter distances between stimuli would represent more perceptual similarity. The color green indicates stimuli that would be perceived as typical voices; purple indicates stimuli that would be perceived as breathy voices. Auditory targets represent acoustic regions that the listeners would target when they produce their typical or breathy voices.
Figure 2.
Figure 2.
Scatter plots and best linear fits of boundary location versus smoothed cepstral peak prominence (CPPS), harmonics-to-noise ratio (HNR), and high-to-low spectral ratio (HL ratio). Spearman's correlations are represented as r values. No significant correlations were found between boundary location and the acoustic measures.
Figure 3.
Figure 3.
Scatter plots and best linear fits of boundary width versus smoothed cepstral peak prominence (CPPS), harmonics-to-noise ratio (HNR), and high-to-low spectral ratio (HL ratio). Spearman's correlations are represented as r values. *p < .05.
Figure 4.
Figure 4.
Mean smoothed cepstral peak prominence (CPPS), harmonics-to-noise ratio (HNR), and high-to-low spectral ratio (HL ratio) of each group. Error bars indicate 95% confidence intervals. *Speakers with narrow boundaries had significantly higher CPPS and HNR than speakers with wide boundaries.
Figure 5.
Figure 5.
(Left) A scatter plot and a best linear fit of boundary width versus breathiness rating from the Consensus Auditory–Perceptual Evaluation of Voice with Spearman's correlation represented as an r value. (Right) Mean breathiness ratings of each group. Error bars indicate 95% confidence intervals. *Speakers with narrow boundaries had significantly lower breathiness ratings than speakers with wide boundaries.
Figure 6.
Figure 6.
Identification curves for all participants, divided into narrow and wide boundary groups (n = 10 each). The x-axes represent the stimuli number. The y-axes are omitted. Each curve starts with 0 identification percentage at Stimuli 1 and ends with 100 identification percentage at Stimuli 10.
Figure 7.
Figure 7.
Boundary discrimination and averaged nonboundary discrimination percentages of individual participants (red: narrow boundary group, blue: wide boundary group), connected by lines.

References

    1. American Speech-Language-Hearing Association. (2002). Consensus Auditory–Perceptual Evaluation of Voice (CAPE-V): ASHA Special Interest Division 3, Voice and Voice Disorders. Retrieved from https://www.asha.org/uploadedFiles/members/divs/D3CAPEVprocedures.pdf
    1. American Speech-Language-Hearing Association. (2005). Guidelines for manual pure-tone threshold audiometry [Guidelines]. Retrieved from http://www.asha.org/policy
    1. Arai T. (2006). Cue parsing between nasality and breathiness in speech perception. Acoustical Science and Technology, 27, 298–301.
    1. Belafsky P. C., Postma G. N., & Koufman J. A. (2002). Validity and reliability of the Reflux Symptom Index (RSI). Journal of Voice, 16(2), 274–277. - PubMed
    1. Best C. T., & Strange W. (1992). Effects of phonological and phonetic factors on cross-language perception of approximants. Journal of Phonetics, 20(3), 305–330.

Publication types

LinkOut - more resources