Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Feb;127(2):69-78.
doi: 10.1177/0003489417743518. Epub 2017 Dec 10.

Predicting Intelligibility Deficit in Dysphonic Speech with Cepstral Peak Prominence

Affiliations

Predicting Intelligibility Deficit in Dysphonic Speech with Cepstral Peak Prominence

Keiko Ishikawa et al. Ann Otol Rhinol Laryngol. 2018 Feb.

Abstract

Objectives: The aim of this study was to examine the potential of cepstral peak prominence (CPP) for predicting the intelligibility deficit in dysphonic speech.

Methods: Sentences from Hearing-in-Noise Test were recorded from 18 speakers with dysphonia and 18 speakers with normal voice. These samples were presented to 60 adults with normal hearing in quiet and noise at signal to noise ratio of +0 dB. Intelligibility was measured by orthographic transcription. Cepstral peak prominence was measured for all samples. Correlation between CPP and intelligibility score was examined.

Results: Intelligibility was significantly lower in dysphonic speech than normal speech in the presence of background noise. The correlation between CPP and intelligibility score was moderate when the intelligibility scores were averaged per speaker.

Conclusions: Cepstral peak prominence only moderately predicts intelligibility deficit in dysphonic speech. Accordingly, CPP alone is not sufficient for describing the deficit.

Keywords: acoustic analysis; background noise; cepstral peak prominence; dysphonia; intelligibility.

PubMed Disclaimer

Conflict of interest statement

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Figures

Figure 1.
Figure 1.
Cepstral peak prominence of (left) normal and (right) breathy voice.
Figure 2.
Figure 2.
Correlation plot of cepstral peak prominence and perceptual ratings of dysphonic voice quality.
Figure 3.
Figure 3.
Scatter plots showing the relationship between cepstral peak prominence (CPP) and intelligibility scores in (left) noisy and (right) quiet listening conditions. The open circles show averaged scores at the sentence level. The score for each sentence is an average of 15 listeners. The black circles show averaged scores at speaker and listener levels. The scores shown here are averages of 10 sentences and 15 listeners for each speaker. The horizontal error bars show the standard deviations of CPP. The vertical error bars show the standard deviations of intelligibility score.

Similar articles

Cited by

References

    1. Jacobson BH, Johnson A, Grywalski C, et al. The voice handicap index (VHI): development and validation. Am J Speech Lang Pathol. 1997;6(3):66–70.
    1. Ishikawa K, Kelchner L, Rao M, et al. The impact of dysphonia as predicted by self-report, perceptual, and acoustic measures: is intelligibility at risk? Paper presented at: 10th International Conference Advances in Quantitative Laryngology; 2013.
    1. Smith E, Taylor M, Mendoza M, Barkmeier J, Lemke J, Hoffman H. Spasmodic dysphonia and vocal fold paralysis: outcomes of voice problems on work-related functioning. J Voice. 1998;12(2):223–232. - PubMed
    1. Miller GA, Nicely PE. An analysis of perceptual confusions among some English consonants. J Acoust Soc Am. 1955;27(2):338–352.
    1. Pearsons K, Bennett R, Fidell S. Speech Levels in Various Noise Environments. Washington, DC: US Environmental Protection Agency; 1977.

LinkOut - more resources