Classification of functional voice disorders based on phonovibrograms
- PMID: 20138486
- DOI: 10.1016/j.artmed.2010.01.001
Classification of functional voice disorders based on phonovibrograms
Abstract
Objective: This work presents a computer-aided method for automatically and objectively classifying individuals with healthy and dysfunctional vocal fold vibration patterns as depicted in clinical high-speed (HS) videos of the larynx.
Methods: By employing a specialized image segmentation and vocal fold movement visualization technique - namely phonovibrography - a novel set of numerical features is derived from laryngeal HS videos capturing the dynamic behavior and the symmetry of oscillating vocal folds. In order to assess the discriminatory power of the features, a support vector machine is applied to the preprocessed data with regard to clinically relevant diagnostic tasks. Finally, the classification performance of the learned nonlinear models is evaluated to allow for conclusions to be drawn about suitability of features and data resulting from different examination paradigms. As a reference, a second feature set is determined which corresponds to more traditional voice analysis approaches.
Results: For the first time an automatic classification of healthy and pathological voices could be obtained by analyzing the vibratory patterns of vocal folds using phonovibrograms (PVGs). An average classification accuracy of approximately 81% was achieved for 2-class discrimination with PVG features. This exceeds the results obtained through traditional voice analysis features. Furthermore, a relevant influence of phonation frequency on classification accuracy was substantiated by the clinical HS data.
Conclusion: The PVG feature extraction and classification approach can be assessed as being promising with regard to the diagnosis of functional voice disorders. The obtained results indicate that an objective analysis of dysfunctional vocal fold vibration can be achieved with considerably high accuracy. Moreover, the PVG classification method holds a lot of potential when it comes to the clinical assessment of voice pathologies in general, as the diagnostic support can be provided to the voice clinician in a timely and reliable manner. Due to the observed interdependency between phonation frequency and classification accuracy, in future comparative studies of HS recordings of oscillating vocal folds homogeneous frequencies should be taken into account during examination.
2010 Elsevier B.V. All rights reserved.
Similar articles
-
A generalized procedure for analyzing sustained and dynamic vocal fold vibrations from laryngeal high-speed videos using phonovibrograms.Artif Intell Med. 2016 Jan;66:15-28. doi: 10.1016/j.artmed.2015.10.002. Epub 2015 Oct 30. Artif Intell Med. 2016. PMID: 26597002
-
Automatic diagnosis of vocal fold paresis by employing phonovibrogram features and machine learning methods.Comput Methods Programs Biomed. 2010 Sep;99(3):275-88. doi: 10.1016/j.cmpb.2010.01.004. Comput Methods Programs Biomed. 2010. PMID: 20138386
-
Phonovibrogram visualization of entire vocal fold dynamics.Laryngoscope. 2008 Apr;118(4):753-8. doi: 10.1097/MLG.0b013e318161f9e1. Laryngoscope. 2008. PMID: 18216742
-
Chaos in voice, from modeling to measurement.J Voice. 2006 Mar;20(1):2-17. doi: 10.1016/j.jvoice.2005.01.001. Epub 2005 Jun 20. J Voice. 2006. PMID: 15964740 Review.
-
Aerodynamics, voice quality, and laryngeal image analysis of normal and pathologic voices.Curr Opin Otolaryngol Head Neck Surg. 2004 Jun;12(3):166-73. doi: 10.1097/01.moo.0000122306.42961.44. Curr Opin Otolaryngol Head Neck Surg. 2004. PMID: 15167024 Review.
Cited by
-
GIRAFE: Glottal imaging dataset for advanced segmentation, analysis, and facilitative playbacks evaluation.Data Brief. 2025 Feb 8;59:111376. doi: 10.1016/j.dib.2025.111376. eCollection 2025 Apr. Data Brief. 2025. PMID: 40027255 Free PMC article. Review.
-
Advanced computing solutions for analysis of laryngeal disorders.Med Biol Eng Comput. 2019 Nov;57(11):2535-2552. doi: 10.1007/s11517-019-02031-9. Epub 2019 Sep 6. Med Biol Eng Comput. 2019. PMID: 31493281 Review.
-
Glottal Gap tracking by a continuous background modeling using inpainting.Med Biol Eng Comput. 2017 Dec;55(12):2123-2141. doi: 10.1007/s11517-017-1652-8. Epub 2017 May 27. Med Biol Eng Comput. 2017. PMID: 28550413
-
[Hoarseness: biomechanisms and quantitative laryngoscopy].HNO. 2014 Jul;62(7):541-52. doi: 10.1007/s00106-014-2868-7. HNO. 2014. PMID: 25056650 German.
-
Machine learning based identification of relevant parameters for functional voice disorders derived from endoscopic high-speed recordings.Sci Rep. 2020 Jun 29;10(1):10517. doi: 10.1038/s41598-020-66405-y. Sci Rep. 2020. PMID: 32601277 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Research Materials