Automatic diagnosis of vocal fold paresis by employing phonovibrogram features and machine learning methods
- PMID: 20138386
- DOI: 10.1016/j.cmpb.2010.01.004
Automatic diagnosis of vocal fold paresis by employing phonovibrogram features and machine learning methods
Abstract
The clinical diagnosis of voice disorders is based on examination of the rapidly moving vocal folds during phonation (f0: 80-300Hz) with state-of-the-art endoscopic high-speed cameras. Commonly, analysis is performed in a subjective and time-consuming manner via slow-motion video playback and exhibits low inter- and intra-rater reliability. In this study an objective method to overcome this drawback is presented being based on Phonovibrography, a novel image analysis technique. For a collective of 45 normophonic and paralytic voices the laryngeal dynamics were captured by specialized Phonovibrogram features and analyzed with different machine learning algorithms. Classification accuracies reached 93% for 2-class and 73% for 3-class discrimination. The results were validated by subjective expert ratings given the same diagnostic criteria. The automatic Phonovibrogram analysis approach exceeded the experienced raters' classifications by 9%. The presented method holds a lot of potential for providing reliable vocal fold diagnosis support in the future.
Copyright 2010 Elsevier Ireland Ltd. All rights reserved.
Similar articles
-
Classification of functional voice disorders based on phonovibrograms.Artif Intell Med. 2010 May;49(1):51-9. doi: 10.1016/j.artmed.2010.01.001. Artif Intell Med. 2010. PMID: 20138486
-
Phonovibrogram visualization of entire vocal fold dynamics.Laryngoscope. 2008 Apr;118(4):753-8. doi: 10.1097/MLG.0b013e318161f9e1. Laryngoscope. 2008. PMID: 18216742
-
Phonovibrography: mapping high-speed movies of vocal fold vibrations into 2-D diagrams for visualizing and analyzing the underlying laryngeal dynamics.IEEE Trans Med Imaging. 2008 Mar;27(3):300-9. doi: 10.1109/TMI.2007.903690. IEEE Trans Med Imaging. 2008. PMID: 18334426
-
[Phonovibrogram: vocal fold dynamics integrated within a single image].HNO. 2008 Dec;56(12):1207-12. doi: 10.1007/s00106-007-1541-9. HNO. 2008. PMID: 17503008 Review. German.
-
Vocal fold paresis and paralysis: what the thyroid surgeon should know.Surg Oncol Clin N Am. 2008 Jan;17(1):175-96. doi: 10.1016/j.soc.2007.10.007. Surg Oncol Clin N Am. 2008. PMID: 18177806 Review.
Cited by
-
Advanced computing solutions for analysis of laryngeal disorders.Med Biol Eng Comput. 2019 Nov;57(11):2535-2552. doi: 10.1007/s11517-019-02031-9. Epub 2019 Sep 6. Med Biol Eng Comput. 2019. PMID: 31493281 Review.
-
Identifying bias in models that detect vocal fold paralysis from audio recordings using explainable machine learning and clinician ratings.PLOS Digit Health. 2024 May 30;3(5):e0000516. doi: 10.1371/journal.pdig.0000516. eCollection 2024 May. PLOS Digit Health. 2024. PMID: 38814939 Free PMC article.
-
Spatiotemporal analysis of vocal fold vibrations between children and adults.Laryngoscope. 2012 Nov;122(11):2511-8. doi: 10.1002/lary.23568. Epub 2012 Sep 10. Laryngoscope. 2012. PMID: 22965771 Free PMC article.
-
Identifying bias in models that detect vocal fold paralysis from audio recordings using explainable machine learning and clinician ratings.medRxiv [Preprint]. 2024 Mar 20:2020.11.23.20235945. doi: 10.1101/2020.11.23.20235945. medRxiv. 2024. Update in: PLOS Digit Health. 2024 May 30;3(5):e0000516. doi: 10.1371/journal.pdig.0000516. PMID: 33501466 Free PMC article. Updated. Preprint.
-
Analysis of vocal fold function from acoustic data simultaneously recorded with high-speed endoscopy.J Voice. 2012 Nov;26(6):726-33. doi: 10.1016/j.jvoice.2012.02.001. Epub 2012 May 25. J Voice. 2012. PMID: 22632795 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources