Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jul 12;60(7):1919-1929.
doi: 10.1044/2017_JSLHR-S-16-0012.

The Effect of Background Noise on Intelligibility of Dysphonic Speech

Affiliations

The Effect of Background Noise on Intelligibility of Dysphonic Speech

Keiko Ishikawa et al. J Speech Lang Hear Res. .

Abstract

Purpose: The aim of this study is to determine the effect of background noise on the intelligibility of dysphonic speech and to examine the relationship between intelligibility in noise and an acoustic measure of dysphonia: cepstral peak prominence (CPP).

Method: A study of speech perception was conducted using speech samples from 6 adult speakers with typical voice and 6 adult speakers with dysphonia. Speech samples were presented to 30 listeners with typical hearing in 3 noise conditions: quiet, signal-to-noise ratio (SNR)+5, and SNR+0. Intelligibility scores were obtained via orthographic transcription as the percentage of correctly identified words. Speech samples were acoustically analyzed using CPP, and the correlation between the CPP measurements and intelligibility scores was examined.

Results: The intelligibility of both typical and dysphonic speech was reduced as the level of background noise increased. The reduction was significantly greater in dysphonic speech. A strong correlation was noted between CPP and intelligibility score at SNR+0.

Conclusions: Dysphonic speech is relatively harder to understand in the presence of background noise as compared with typical speech. CPP may be a useful predictor of this intelligibility deficit. Future work is needed to confirm these findings with a larger number of speakers and speech materials with known predictability.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Mean intelligibility scores of normal and dysphonic groups in all noise conditions. The values are in percentages. Error bars indicate standard errors. SNR = signal-to-noise ratio.
Figure 2.
Figure 2.
Scatter plots of cepstral peak prominence (CPP) and intelligibility scores in quiet, SNR+5, and SNR+0 conditions. Intelligibility scores are averaged across listeners. The shaded area shows the 95% confidence interval for the mean value of the intelligibility score for a given value of CPP for all values of CPP. SNR = signal-to-noise ratio.
Figure 3.
Figure 3.
Scatter plots of cepstral peak prominence (CPP) and intelligibility scores in quiet, SNR+5, and SNR+0 conditions. Intelligibility scores are averaged across sentences per speaker and listeners. The shaded area shows the 95% confidence interval for the mean value of intelligibility score for a given value of CPP for all values of CPP. SNR = signal-to-noise ratio.

Similar articles

Cited by

References

    1. Adams S., Dykstra A., Jenkins M., & Jog M. (2008). Speech-to-noise levels and conversational intelligibility in hypophonia and Parkinson's disease. Journal of Medical Speech-Language Pathology, 16, 165–172.
    1. Awan S. N., Roy N., Jetté M. E., Meltzner G. S., & Hillman R. E. (2010). Quantifying dysphonia severity using a spectral/cepstral-based acoustic index: Comparisons with auditory-perceptual judgements from the CAPE-V. Clinical Linguistics & Phonetics, 24, 742–758. - PubMed
    1. Baken R. J., & Orlikoff R. F. (2000). Clinical measurement of speech and voice. Boston, MA: Cengage Learning.
    1. Benninger M. S., Alessi D., Archer S., Bastian R., Ford C., Koufman J., … Woo P. (1996). Vocal fold scarring: Current concepts and management. Otolaryngology—Head and Neck Surgery, 115, 474–482. - PubMed
    1. Bickley C. (1982). Acoustic analysis and perception of breathy vowels. Speech Communication Group Working Papers, 1, 71–81.

LinkOut - more resources