Review

. 2017:2017:1945630.

doi: 10.1155/2017/1945630. Epub 2017 Mar 5.

Random Deep Belief Networks for Recognizing Emotions from Speech Signals

Guihua Wen¹, Huihui Li¹, Jubing Huang¹, Danyang Li¹, Eryang Xun¹

Affiliations

PMID: 28356908
PMCID: PMC5357547
DOI: 10.1155/2017/1945630

Review

Random Deep Belief Networks for Recognizing Emotions from Speech Signals

Guihua Wen et al. Comput Intell Neurosci. 2017.

. 2017:2017:1945630.

doi: 10.1155/2017/1945630. Epub 2017 Mar 5.

Authors

Guihua Wen¹, Huihui Li¹, Jubing Huang¹, Danyang Li¹, Eryang Xun¹

Affiliation

¹ School of Computer Science and Engineering, South China University of Technology, Guangzhou, China.

PMID: 28356908
PMCID: PMC5357547
DOI: 10.1155/2017/1945630

Abstract

Now the human emotions can be recognized from speech signals using machine learning methods; however, they are challenged by the lower recognition accuracies in real applications due to lack of the rich representation ability. Deep belief networks (DBN) can automatically discover the multiple levels of representations in speech signals. To make full of its advantages, this paper presents an ensemble of random deep belief networks (RDBN) method for speech emotion recognition. It firstly extracts the low level features of the input speech signal and then applies them to construct lots of random subspaces. Each random subspace is then provided for DBN to yield the higher level features as the input of the classifier to output an emotion label. All outputted emotion labels are then fused through the majority voting to decide the final emotion label for the input speech signal. The conducted experimental results on benchmark speech emotion databases show that RDBN has better accuracy than the compared methods for speech emotion recognition.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

**Figure 1**
Structure of deep belief network.

**Figure 2**
Structure of the standard RBM.

**Figure 3**
Framework of RDBN for speech emotion recognition, illustrating the method to create the base classifiers for the ensemble through random subspace, DBN, and SVM, where the majority voting is applied to perform the fusion.

**Figure 4**
Accuracies (WA) vary with the number of features for each ensemble size on EMODB, aiming to find the optimal ensemble size and the number of features for RDBN on this database.

**Figure 5**
Accuracies (WA) vary with the number of features for each ensemble size on CASIA, aiming to find the optimal ensemble size and the number of features for RDBN on this database.

**Figure 6**
Accuracies (WA) vary with the number of features for each ensemble size on SAVEE, aiming to find the optimal ensemble size and the number of features for RDBN on this database.

**Figure 7**
Accuracies (WA) vary with the number of features for each ensemble size on FAU database, aiming to find the optimal ensemble size and the number of features for RDBN on this database.

See this image and copyright information in PMC

Cited by

An enhanced speech emotion recognition using vision transformer.
Akinpelu S, Viriri S, Adegun A. Akinpelu S, et al. Sci Rep. 2024 Jun 7;14(1):13126. doi: 10.1038/s41598-024-63776-4. Sci Rep. 2024. PMID: 38849422 Free PMC article.
Deep learning-based EEG emotion recognition: Current trends and future perspectives.
Wang X, Ren Y, Luo Z, He W, Hong J, Huang Y. Wang X, et al. Front Psychol. 2023 Feb 27;14:1126994. doi: 10.3389/fpsyg.2023.1126994. eCollection 2023. Front Psychol. 2023. PMID: 36923142 Free PMC article. Review.
The Emotion Probe: On the Universality of Cross-Linguistic and Cross-Gender Speech Emotion Recognition via Machine Learning.
Costantini G, Parada-Cabaleiro E, Casali D, Cesarini V. Costantini G, et al. Sensors (Basel). 2022 Mar 23;22(7):2461. doi: 10.3390/s22072461. Sensors (Basel). 2022. PMID: 35408076 Free PMC article.
Bidirectional parallel echo state network for speech emotion recognition.
Ibrahim H, Loo CK, Alnajjar F. Ibrahim H, et al. Neural Comput Appl. 2022;34(20):17581-17599. doi: 10.1007/s00521-022-07410-2. Epub 2022 May 31. Neural Comput Appl. 2022. PMID: 35669535 Free PMC article.
Survey on Deep Neural Networks in Speech and Vision Systems.
Alam M, Samad MD, Vidyaratne L, Glandon A, Iftekharuddin KM. Alam M, et al. Neurocomputing (Amst). 2020 Dec 5;417:302-321. doi: 10.1016/j.neucom.2020.07.053. Epub 2020 Jul 26. Neurocomputing (Amst). 2020. PMID: 33100581 Free PMC article.

See all "Cited by" articles

References

1. Fong B., Westerink J. Affective computing in consumer electronics. IEEE Transactions on Affective Computing. 2012;3(2):129–131. doi: 10.1109/T-AFFC.2012.20. - DOI
1. El Ayadi M., Kamel M. S., Karray F. Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recognition. 2011;44(3):572–587. doi: 10.1016/j.patcog.2010.09.020. - DOI
1. Harimi A., AhmadyFard A., Shahzadi A., Yaghmaie K. Anger or joy? Emotion recognition using nonlinear dynamics of speech. Applied Artificial Intelligence. 2015;29(7):675–696. doi: 10.1080/08839514.2015.1051891. - DOI
1. Sun Y., Wen G. Ensemble softmax regression model for speech emotion recognition. Multimedia Tools and Applications. 2016:1–24. doi: 10.1007/s11042-016-3487-y. - DOI
1. Park J.-S., Kim J.-H., Oh Y.-H. Feature vector classification based speech emotion recognition for service robots. IEEE Transactions on Consumer Electronics. 2009;55(3):1590–1596. doi: 10.1109/TCE.2009.5278031. - DOI

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Random Deep Belief Networks for Recognizing Emotions from Speech Signals

Affiliation

Random Deep Belief Networks for Recognizing Emotions from Speech Signals

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources