Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors
- PMID: 14765711
- DOI: 10.1109/TBME.2003.820386
Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors
Abstract
It is well known that vocal and voice diseases do not necessarily cause perceptible changes in the acoustic voice signal. Acoustic analysis is a useful tool to diagnose voice diseases being a complementary technique to other methods based on direct observation of the vocal folds by laryngoscopy. Through the present paper two neural-network based classification approaches applied to the automatic detection of voice disorders will be studied. Structures studied are multilayer perceptron and learning vector quantization fed using short-term vectors calculated accordingly to the well-known Mel Frequency Coefficient cepstral parameterization. The paper shows that these architectures allow the detection of voice disorders--including glottic cancer--under highly reliable conditions. Within this context, the Learning Vector quantization methodology demonstrated to be more reliable than the multilayer perceptron architecture yielding 96% frame accuracy under similar working conditions.
Similar articles
-
Dimensionality reduction of a pathological voice quality assessment system based on Gaussian mixture models and short-term cepstral parameters.IEEE Trans Biomed Eng. 2006 Oct;53(10):1943-53. doi: 10.1109/TBME.2006.871883. IEEE Trans Biomed Eng. 2006. PMID: 17019858
-
Discrimination of pathological voices using a time-frequency approach.IEEE Trans Biomed Eng. 2005 Mar;52(3):421-30. doi: 10.1109/TBME.2004.842962. IEEE Trans Biomed Eng. 2005. PMID: 15759572
-
Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model.J Voice. 2016 Nov;30(6):757.e7-757.e19. doi: 10.1016/j.jvoice.2015.08.010. Epub 2015 Oct 27. J Voice. 2016. PMID: 26522263
-
Automatic detection of laryngeal pathologies in records of sustained vowels by means of mel-frequency cepstral coefficient parameters and differentiation of patients by sex.Folia Phoniatr Logop. 2009;61(3):146-52. doi: 10.1159/000219950. Epub 2009 Jul 1. Folia Phoniatr Logop. 2009. PMID: 19571549 Review.
-
A Survey on Machine Learning Approaches for Automatic Detection of Voice Disorders.J Voice. 2019 Nov;33(6):947.e11-947.e33. doi: 10.1016/j.jvoice.2018.07.014. Epub 2018 Oct 11. J Voice. 2019. PMID: 30316551 Review.
Cited by
-
Pathological speech signal analysis and classification using empirical mode decomposition.Med Biol Eng Comput. 2013 Jul;51(7):811-21. doi: 10.1007/s11517-013-1051-8. Epub 2013 Mar 5. Med Biol Eng Comput. 2013. PMID: 23460198
-
Modulation Spectra Morphological Parameters: A New Method to Assess Voice Pathologies according to the GRBAS Scale.Biomed Res Int. 2015;2015:259239. doi: 10.1155/2015/259239. Epub 2015 Oct 18. Biomed Res Int. 2015. PMID: 26557656 Free PMC article.
-
A new approach: information gain algorithm-based k-nearest neighbors hybrid diagnostic system for Parkinson's disease.Phys Eng Sci Med. 2021 Jun;44(2):511-524. doi: 10.1007/s13246-021-01001-6. Epub 2021 Apr 14. Phys Eng Sci Med. 2021. PMID: 33852120
-
Advances in laryngeal imaging.Eur Arch Otorhinolaryngol. 2009 Oct;266(10):1509-20. doi: 10.1007/s00405-009-1050-4. Epub 2009 Jul 19. Eur Arch Otorhinolaryngol. 2009. PMID: 19618198 Review.
-
End-to-end deep learning classification of vocal pathology using stacked vowels.Laryngoscope Investig Otolaryngol. 2023 Aug 31;8(5):1312-1318. doi: 10.1002/lio2.1144. eCollection 2023 Oct. Laryngoscope Investig Otolaryngol. 2023. PMID: 37899847 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical