Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures

Jonathan Darch¹, Ben Milner, Saeed Vaseghi

Affiliations

PMID: 19206822
DOI: 10.1121/1.2997436

Comparative Study

Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures

Jonathan Darch et al. J Acoust Soc Am. 2008 Dec.

. 2008 Dec;124(6):3989-4000.

doi: 10.1121/1.2997436.

Authors

Jonathan Darch¹, Ben Milner, Saeed Vaseghi

Affiliation

¹ School of Computing Sciences, University of East Anglia, Norwich, United Kingdom.

PMID: 19206822
DOI: 10.1121/1.2997436

Abstract

The aim of this work is to develop methods that enable acoustic speech features to be predicted from mel-frequency cepstral coefficient (MFCC) vectors as may be encountered in distributed speech recognition architectures. The work begins with a detailed analysis of the multiple correlation between acoustic speech features and MFCC vectors. This confirms the existence of correlation, which is found to be higher when measured within specific phonemes rather than globally across all speech sounds. The correlation analysis leads to the development of a statistical method of predicting acoustic speech features from MFCC vectors that utilizes a network of hidden Markov models (HMMs) to localize prediction to specific phonemes. Within each HMM, the joint density of acoustic features and MFCC vectors is modeled and used to make a maximum a posteriori prediction. Experimental results are presented across a range of conditions, such as with speaker-dependent, gender-dependent, and gender-independent constraints, and these show that acoustic speech features can be predicted from MFCC vectors with good accuracy. A comparison is also made against an alternative scheme that substitutes the higher-order MFCCs with acoustic features for transmission. This delivers accurate acoustic features but at the expense of a significant reduction in speech recognition accuracy.

PubMed Disclaimer

Cited by

A novel approach for acoustic estimation of neck fluid volume between men and women.
Shokrollahi M, Rudzicz F, Vena D, Bradley TD, Yadollahi A. Shokrollahi M, et al. Med Biol Eng Comput. 2018 Jan;56(1):113-123. doi: 10.1007/s11517-017-1675-1. Epub 2017 Jul 5. Med Biol Eng Comput. 2018. PMID: 28676955

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Silverchair Information Systems
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures

Affiliation

Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures

Authors

Affiliation

Abstract

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Similar articles

Cited by

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources