Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Dec 13;9(1):19066.
doi: 10.1038/s41598-019-55271-y.

Phonetic relevance and phonemic grouping of speech in the automatic detection of Parkinson's Disease

Affiliations

Phonetic relevance and phonemic grouping of speech in the automatic detection of Parkinson's Disease

Laureano Moro-Velazquez et al. Sci Rep. .

Abstract

Literature documents the impact of Parkinson's Disease (PD) on speech but no study has analyzed in detail the importance of the distinct phonemic groups for the automatic identification of the disease. This study presents new approaches that are evaluated in three different corpora containing speakers suffering from PD with two main objectives: to investigate the influence of the different phonemic groups in the detection of PD and to propose more accurate detection schemes employing speech. The proposed methodology uses GMM-UBM classifiers combined with a technique introduced in this paper called phonemic grouping, that permits observation of the differences in accuracy depending on the manner of articulation. Cross-validation results reach accuracies between 85% and 94% with AUC ranging from 0.91 to 0.98, while cross-corpora trials yield accuracies between 75% and 82% with AUC between 0.84 and 0.95, depending on the corpus. This is the first work analyzing the generalization properties of the proposed approaches employing cross-corpora trials and reaching high accuracies. Among the different phonemic groups, results suggest that plosives, vowels and fricatives are the most relevant acoustic segments for the detection of PD with the proposed schemes. In addition, the use of text-dependent utterances leads to more consistent and accurate models.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Waveforms and spectrograms of a speaker with PD (newly diagnosed) and a control speaker pronouncing the word [petaka]. Obtained from the Neurovoz corpus. Red dot-lines mark the first four formants calculated with Praat software. (A) Idiopathic PD female speaker. Age: 59. UPDRS: 9. Span: 720 ms. (B) Control female speaker. Age: 59. Span: 857 ms
Figure 2
Figure 2
Waveforms and spectrograms of a speaker with PD (in an advanced stage) and a control speaker pronouncing the word [petaka]. Obtained from the Neurovoz corpus. Red dot-lines mark the first four formants calculated with Praat software. (A) Idiopathic PD female speaker. Age: 85. UPDRS: 47. Span: 810 ms. (B) Control female speaker. Age: 83. Span: 780 ms.
Figure 3
Figure 3
First proposed approach. Phonemic grouping methodology is applied to the parkinsonian corpus (raw-phon).
Figure 4
Figure 4
Second proposed approach. Phonemic grouping methodology is applied to the parkinsonian and UBM corpora (phon-phon).
Figure 5
Figure 5
Third proposed approach. Phonemic grouping methodology is applied to the UBM corpus (phon-raw).
Figure 6
Figure 6
Representation of Gaussians in the third approach (phon-raw). The GMM-UBM in the example contains 7 Gaussians modelling 2 features. Plosive grouping was applied to the UBM which was adapted with all the frames from a DDK task.
Figure 7
Figure 7
Diagram of trials. In the diagram, the classifiers are GMM-UBM where the UBM is trained with Albayzin. (A) Scheme of cross-validation trials (11 folds). The classifiers can be referred to any type of phonemic grouping (fricative, liquid, nasal, plosive or vowels) or approach (baseline, raw-phon, phon-phon or phon-raw). (B) Scheme of cross-corpora trials. The classifiers can be referred to any type of phonemic grouping. Only the baseline and the proposed approach leading to the best results in cross-validation trials were used in cross-corpora trials.
Figure 8
Figure 8
Best accuracies (A) and AUC (B). Results are referred to the three proposed approaches (marked as 1, 2 or 3) and speech tasks, where mon stands for monologues and cross for cross-corpora trials.

References

    1. Pfeiffer, R. F., Wszolek, Z. K. & Ebadi, M. Parkinson’s Disease (CRC Press, 2013).
    1. Hughes AJ, Daniel SE, Ben-Shlomo Y, Lees AJ. The accuracy of diagnosis of parkinsonian syndromes in a specialist movement disorder service. Brain. 2002;125:861–870. doi: 10.1093/brain/awf080. - DOI - PubMed
    1. Pujols, J. et al. Small molecule inhibits alpha-synuclein aggregation, disrupts amyloid fibrils, and prevents degeneration of dopaminergic neurons. Proc. Natl. Acad. Sci. (2018). - PMC - PubMed
    1. Ishihara LS, Cheesbrough A, Brayne C, Schrag A. Estimated life expectancy of parkinson”s patients compared with the uk population. J. Neurol. Neurosurg. & Psychiatry. 2007;78:1304–1309. doi: 10.1136/jnnp.2006.100107. - DOI - PMC - PubMed
    1. Moro Velázquez, L. Towards the differential evaluation of Parkinson’s Disease by means of voice and speech processing. Ph.D. thesis, ETSIS Telecomunicacion - Universidad Politecnica de Madrid (2018).

Publication types