Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Aug;57(4):1191-205.
doi: 10.1044/2014_JSLHR-S-13-0086.

Hybridizing conversational and clear speech to investigate the source of increased intelligibility in speakers with Parkinson's disease

Hybridizing conversational and clear speech to investigate the source of increased intelligibility in speakers with Parkinson's disease

Kris Tjaden et al. J Speech Lang Hear Res. 2014 Aug.

Abstract

Purpose: A speech analysis-resynthesis paradigm was used to investigate segmental and suprasegmental acoustic variables explaining intelligibility variation for 2 speakers with Parkinson's disease (PD).

Method: Sentences were read in conversational and clear styles. Acoustic characteristics from clear sentences were extracted and applied to conversational sentences, yielding 6 hybridized versions of sentences in which segment durations, short-term spectrum, energy characteristics, or fundamental frequency characteristics for clear productions were applied individually or in combination to conversational productions. Listeners (N = 20) judged intelligibility in transcription and scaling tasks.

Results: Intelligibility increases above conversation were more robust for transcription, but the pattern of intelligibility improvement was similar across tasks. For 1 speaker, hybridization involving only clear energy characteristics yielded an 8.7% improvement in transcription intelligibility above conversation. For the other speaker, hybridization involving clear spectrum yielded an 18% intelligibility improvement, whereas hybridization involving both clear spectrum and duration yielded a 13.4% improvement.

Conclusions: Not all production changes accompanying clear speech explain its improved intelligibility. Suprasegmental adjustments contributed to intelligibility improvements when segmental adjustments, as inferred from vowel space area, were not robust. Hybridization can be used to identify acoustic variables explaining intelligibility variation in mild dysarthria secondary to PD.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Block diagram summarizing hybridization process. CNV = conversational; CLR = clear; ins = insert; del = delete; aux = auxiliary; D = duration; E = energy; I = intonation; S = short-term spectrum; HYB-DS = hybrid of duration and short-term spectra.
Figure 2
Figure 2
Selected amplitude normalized hybrid stimuli as well as the amplitude normalized conversational and clear productions for PDM01 are shown. The waveform display corresponds to the conversational production. In each panel, root-mean-square intensity is shown in gray, and the F0 time history is shown in black. The aspect ratio of the x-axis for the clear production differs from other panels to facilitate comparison of energy and F0 traces from the clear production to the hybrids as well as the conversation production.
Figure 3
Figure 3
Selected amplitude normalized hybrid stimuli, as well as the amplitude normalized conversational and clear productions for PDM06 are shown.
Figure 4
Figure 4
Mean percentage correct scores and scale values pooled across sentences and listeners are reported. Vertical bars indicate ±1 SD. Results for conversational (CON) and clear stimuli are shown on the extremes of the x-axis, and results for hybrids—duration (D); the combination of duration and short-term spectrum (DS); energy (E); intonation (I); prosody, defined as the combination of intonation, energy, and duration (IED); and short-term spectrum (S)—are reported in the middle of the x-axes.
Figure 5
Figure 5
Transcription results for individual sentences are reported for PDM01.
Figure 6
Figure 6
Transcription results for individual sentences are reported for PDM06. White symbols indicate sentences for which the duration/spectrum hybrid was associated with the best intelligibility among the six hybrid variants as well as improved intelligibility above conversational (n = 3). Black symbols indicate sentences for which the spectrum hybrid was associated with the best intelligibility among the six hybrids as well as improved intelligibility relative to conversational (n = 6).

References

    1. American National Standards Institute. Specifications for audiometers. New York, NY: Author; 2004. (ANSI S3.6-2004)
    1. Binns C, Culling JF. The role of fundamental frequency contours in the perception of speech against interfering speech. The Journal of the Acoustical Society of America. 2007;122:1765–1776. - PubMed
    1. Bochner JH, Garrison WM, Sussman JE, Burkard RF. Development of materials for the clinical assessment of speech recognition: The speech sound pattern discrimination test. Journal of Speech, Language, and Hearing Research. 2003;46:889–900. - PubMed
    1. Boersma P, Weenink D. Praat: Doing phonetics by computer (Version 5.3.41) [Computer program] 2013 Retrieved from http://www.praat.org/
    1. Bradlow AR, Kraus N, Hayes E. Speaking clearly for children with learning disabilities: Sentence perception in noise. Journal of Speech, Language, and Hearing Research. 2003;46:80–97. - PubMed

Publication types

LinkOut - more resources