Auditory-model based robust feature selection for speech recognition
- PMID: 20136182
- DOI: 10.1121/1.3284545
Auditory-model based robust feature selection for speech recognition
Abstract
It is shown that robust dimension-reduction of a feature set for speech recognition can be based on a model of the human auditory system. Whereas conventional methods optimize classification performance, the proposed method exploits knowledge implicit in the auditory periphery, inheriting its robustness. Features are selected to maximize the similarity of the Euclidean geometry of the feature domain and the perceptual domain. Recognition experiments using mel-frequency cepstral coefficients (MFCCs) confirm the effectiveness of the approach, which does not require labeled training data. For noisy data the method outperforms commonly used discriminant-analysis based dimension-reduction methods that rely on labeling. The results indicate that selecting MFCCs in their natural order results in subsets with good performance.
Similar articles
-
A computer model of auditory efferent suppression: implications for the recognition of speech in noise.J Acoust Soc Am. 2010 Feb;127(2):943-54. doi: 10.1121/1.3273893. J Acoust Soc Am. 2010. PMID: 20136217
-
Statistical modeling of speech Poincaré sections in combination of frequency analysis to improve speech recognition performance.Chaos. 2010 Sep;20(3):033106. doi: 10.1063/1.3463722. Chaos. 2010. PMID: 20887046
-
Noise-robust acoustic signature recognition using nonlinear Hebbian learning.Neural Netw. 2010 Dec;23(10):1252-63. doi: 10.1016/j.neunet.2010.07.003. Epub 2010 Jul 23. Neural Netw. 2010. PMID: 20655704
-
Classification of stop place in consonant-vowel contexts using feature extrapolation of acoustic-phonetic features in telephone speech.J Acoust Soc Am. 2012 Feb;131(2):1536-46. doi: 10.1121/1.3672706. J Acoust Soc Am. 2012. PMID: 22352523
-
Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures.J Acoust Soc Am. 2008 Dec;124(6):3989-4000. doi: 10.1121/1.2997436. J Acoust Soc Am. 2008. PMID: 19206822
MeSH terms
LinkOut - more resources
Full Text Sources