Automatic diagnosis of vocal fold paresis by employing phonovibrogram features and machine learning methods
- PMID: 20138386
- DOI: 10.1016/j.cmpb.2010.01.004
Automatic diagnosis of vocal fold paresis by employing phonovibrogram features and machine learning methods
Abstract
The clinical diagnosis of voice disorders is based on examination of the rapidly moving vocal folds during phonation (f0: 80-300Hz) with state-of-the-art endoscopic high-speed cameras. Commonly, analysis is performed in a subjective and time-consuming manner via slow-motion video playback and exhibits low inter- and intra-rater reliability. In this study an objective method to overcome this drawback is presented being based on Phonovibrography, a novel image analysis technique. For a collective of 45 normophonic and paralytic voices the laryngeal dynamics were captured by specialized Phonovibrogram features and analyzed with different machine learning algorithms. Classification accuracies reached 93% for 2-class and 73% for 3-class discrimination. The results were validated by subjective expert ratings given the same diagnostic criteria. The automatic Phonovibrogram analysis approach exceeded the experienced raters' classifications by 9%. The presented method holds a lot of potential for providing reliable vocal fold diagnosis support in the future.
Copyright 2010 Elsevier Ireland Ltd. All rights reserved.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources