Vocal tract representation in the recognition of cerebral palsied speech
- PMID: 22271873
- DOI: 10.1044/1092-4388(2011/11-0223)
Vocal tract representation in the recognition of cerebral palsied speech
Abstract
Purpose: In this study, the authors explored articulatory information as a means of improving the recognition of dysarthric speech by machine.
Method: Data were derived chiefly from the TORGO database of dysarthric articulation (Rudzicz, Namasivayam, & Wolff, 2011) in which motions of various points in the vocal tract are measured during speech. In the 1st experiment, the authors provided a baseline model indicating a relatively low performance with traditional automatic speech recognition (ASR) using only acoustic data from dysarthric individuals. In the 2nd experiment, the authors used various measures of entropy (statistical disorder) to determine whether characteristics of dysarthric articulation can reduce uncertainty in features of dysarthric acoustics. These findings led to the 3rd experiment, in which recorded dysarthric articulation was directly encoded into the speech recognition process.
Results: The authors found that 18.3% of the statistical disorder in the acoustics of speakers with dysarthria can be removed if articulatory parameters are known. Using articulatory models reduces phoneme recognition errors relatively by up to 6% for speakers with dysarthria in speaker-dependent systems.
Conclusions: Articulatory knowledge is useful in reducing rates of error in ASR for speakers with dysarthria and in reducing statistical uncertainty of their acoustic signals. These findings may help to guide clinical decisions related to the use of ASR in the future.
Similar articles
-
Frequency of consonant articulation errors in dysarthric speech.Clin Linguist Phon. 2010 Oct;24(10):759-70. doi: 10.3109/02699206.2010.497238. Clin Linguist Phon. 2010. PMID: 20831376
-
Relationship between kinematics, F2 slope and speech intelligibility in dysarthria due to cerebral palsy.Clin Linguist Phon. 2012 Sep;26(9):806-22. doi: 10.3109/02699206.2012.706686. Clin Linguist Phon. 2012. PMID: 22876770
-
Comparison of speaking rate, articulation rate and alternating motion rate in dysarthric speakers.Folia Phoniatr Logop. 2006;58(2):114-31. doi: 10.1159/000089612. Folia Phoniatr Logop. 2006. PMID: 16479133
-
Studies of Chinese speakers with dysarthria: informing theoretical models.Folia Phoniatr Logop. 2010;62(3):92-6. doi: 10.1159/000287206. Epub 2010 Apr 29. Folia Phoniatr Logop. 2010. PMID: 20424463 Review.
-
Acoustic studies of dysarthric speech: methods, progress, and potential.J Commun Disord. 1999 May-Jun;32(3):141-80, 183-6; quiz 181-3, 187-9. doi: 10.1016/s0021-9924(99)00004-0. J Commun Disord. 1999. PMID: 10382143 Review.
Cited by
-
Consonantal Landmarks as Predictors of Dysarthria among English-Speaking Adults with Cerebral Palsy.Brain Sci. 2021 Nov 23;11(12):1550. doi: 10.3390/brainsci11121550. Brain Sci. 2021. PMID: 34942852 Free PMC article.
-
An Optimal Set of Flesh Points on Tongue and Lips for Speech-Movement Classification.J Speech Lang Hear Res. 2016 Feb;59(1):15-26. doi: 10.1044/2015_JSLHR-S-14-0112. J Speech Lang Hear Res. 2016. PMID: 26564030 Free PMC article.
-
Interarticulator coordination in children with and without cerebral palsy.Dev Neurorehabil. 2017 Jan;20(1):1-13. doi: 10.3109/17518423.2015.1022809. Epub 2015 Apr 23. Dev Neurorehabil. 2017. PMID: 25905558 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical