Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features
- PMID: 26681977
- PMCID: PMC4670637
- DOI: 10.1155/2015/956249
Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features
Abstract
The Mel Frequency Cepstral Coefficients (MFCCs) are widely used in order to extract essential information from a voice signal and became a popular feature extractor used in audio processing. However, MFCC features are usually calculated from a single window (taper) characterized by large variance. This study shows investigations on reducing variance for the classification of two different voice qualities (normal voice and disordered voice) using multitaper MFCC features. We also compare their performance by newly proposed windowing techniques and conventional single-taper technique. The results demonstrate that adapted weighted Thomson multitaper method could distinguish between normal voice and disordered voice better than the results done by the conventional single-taper (Hamming window) technique and two newly proposed windowing methods. The multitaper MFCC features may be helpful in identifying voices at risk for a real pathology that has to be proven later.
Figures










References
-
- Omori K. Diagnosis of voice disorders. Japan Medical Association Journal. 2011;54(4):248–253.
-
- Amara F., Fezari M. Recent Advances in Biology, Medical Physics, Medical Chemistry, Biochemistry and Biomedical Engineering. 2013. Voice pathologies classification using GMM and SVM classifiers; p. p. 65.
-
- Henríquez P., Alonso J. B., Ferrer M. A., Travieso C. M., Godino-Llorente J. I., Díaz-de-María F. Characterization of healthy and pathological voice through measures based on nonlinear dynamics. IEEE Transactions on Audio, Speech, and Language Processing. 2009;17(6):1186–1195. doi: 10.1109/tasl.2009.2016734. - DOI
-
- Carvalho R. T. S., Cavalcante C. C., Cortez P. C. Wavelet transform and artificial neural networks applied to voice disorders identification. Proceedings of the 3rd World Congress on Nature and Biologically Inspired Computing (NaBIC '11); October 2011; Salamanca, Spain. IEEE; pp. 371–376. - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical