Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Mar;33(2):204-213.
doi: 10.1016/j.jvoice.2017.09.024. Epub 2017 Nov 20.

The Perception of Breathiness in the Voices of Pediatric Speakers

Affiliations

The Perception of Breathiness in the Voices of Pediatric Speakers

Lisa M Kopf et al. J Voice. 2019 Mar.

Abstract

Background: The perception of pediatric voice quality has been investigated using clinical protocols developed for adult voices and acoustic analyses designed to identify important physical parameters associated with normal and dysphonic pediatric voices. Laboratory investigations of adult dysphonia have included sophisticated methods, including a psychoacoustic approach that involves a single-variable matching task (SVMT), characterized by high inter- and intra-listener reliability, and analyses that include bio-inspired models of auditory perception that have provided valuable information regarding adult voice quality.

Objectives: To establish the utility of a psychoacoustic approach to the investigation of voice quality perception in the context of pediatric voices?

Methods: Six listeners judged the breathiness of 20 synthetic vowel stimuli using an SVMT. To support comparisons with previous data, stimuli were modeled after four pediatric speakers and synthesized using Klatt with five parameter settings that influence the perception of breathiness. The population average breathiness judgments were modeled with acoustic measures of loudness ratio, pitch strength, and cepstral peak.

Results: Listeners reliably judged the perceived breathiness of pediatric voices, as with previous investigations of breathiness in adult dysphonic voices. Breathiness judgments were accurately modeled by loudness ratio (r2 = 0.93), pitch strength (r2 = 0.91), and cepstral peak (r2 = 0.82). Model accuracy was not affected significantly by including stimulus fundamental frequency and was slightly higher for pediatric than for adult voices.

Conclusions: The SVMT proved robust for pediatric voices spanning a wide range of breathiness. The data indicate that this is a promising approach for future investigation of pediatric voice quality.

Keywords: Breathiness; Listener perception; Matching task; Pediatric dysphonia.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Fourier spectra of AH-OQ for two synthetic talkers. Both talkers are based on the same natural voice and represent the two endpoints of the breathy continuum (continuum positions 1 and 5) where position 1 represents a voice with a low degree of breathiness and position 5 represents a voice with high degree of breathiness. Values in the inset of each panel highlight the acoustic indices described the text.
Figure 2
Figure 2
Matching NSR by continuum position. The mean ± SE are given, averaged over all listeners.
Figure 3
Figure 3
Matching NSR by loudness ratio in dB. The mean ± SE are given, averaged over all listeners. Black symbols are for pediatric voices, with symbol type separating the four different talkers. Grey symbols correspond to adult voices from Shrivastav et al. (2011). Lines with the same shading scheme represent linear regression. In both cases, goodness of fit was high (pediatric voices: r2 = 0.93; adult voices: r2 = 0.72).
Figure 4
Figure 4
Matching NSR by pitch strength. The mean ± SE are given, averaged over all listeners. Black symbols are for pediatric voices, with symbol type separating the four different talkers. Grey symbols correspond to adult voices from Shrivastav et al. (2011). Lines with the same shading scheme represent linear regression. In both cases, goodness of fit was high (pediatric voices: r2 = 0.91; adult voices: r2 = 0.71).
Figure 5
Figure 5
Matching NSR by cepstral peak in dB. The mean ± SE are given, averaged over all listeners. Black symbols are for pediatric voices, with symbol type separating the four different talkers. Grey symbols correspond to adult voices from Shrivastav et al. (2011). Lines with the same shading scheme represent linear regression. In both cases, goodness of fit was high (pediatric voices: r2 = 0.82; adult voices: r2 = 0.91).

Similar articles

Cited by

References

    1. Carding PN, Roulstone S, Northstone K, Team AS. The prevalence of childhood dysphonia: a cross-sectional study. Journal of Voice. 2006;20(4):623–30. - PubMed
    1. Bhattacharyya N. The prevalence of pediatric voice and swallowing problems in the United States. The Laryngoscope. 2015;125(3):746–50. - PubMed
    1. Andrews M. Voice Treatment for Children and Adolescents. San Diego, CA: Singular Publishing Group, Inc; 2002.
    1. Ruddy BH, Sapienza CM. Treating Voice Disorders in the School-Based SettingWorking Within the Framework of IDEA. Language, Speech, and Hearing Services in Schools. 2004;35(4):327–32. - PubMed
    1. Connor N, Cohen S, Theis S, Thibeault S, Heatley D, Bless D. Attitudes of children with dysphonia. Journal of Voice. 2008;22(1):197–209. - PubMed

LinkOut - more resources