Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 23;22(7):2461.
doi: 10.3390/s22072461.

The Emotion Probe: On the Universality of Cross-Linguistic and Cross-Gender Speech Emotion Recognition via Machine Learning

Affiliations

The Emotion Probe: On the Universality of Cross-Linguistic and Cross-Gender Speech Emotion Recognition via Machine Learning

Giovanni Costantini et al. Sensors (Basel). .

Abstract

Machine Learning (ML) algorithms within a human-computer framework are the leading force in speech emotion recognition (SER). However, few studies explore cross-corpora aspects of SER; this work aims to explore the feasibility and characteristics of a cross-linguistic, cross-gender SER. Three ML classifiers (SVM, Naïve Bayes and MLP) are applied to acoustic features, obtained through a procedure based on Kononenko's discretization and correlation-based feature selection. The system encompasses five emotions (disgust, fear, happiness, anger and sadness), using the Emofilm database, comprised of short clips of English movies and the respective Italian and Spanish dubbed versions, for a total of 1115 annotated utterances. The results see MLP as the most effective classifier, with accuracies higher than 90% for single-language approaches, while the cross-language classifier still yields accuracies higher than 80%. The results show cross-gender tasks to be more difficult than those involving two languages, suggesting greater differences between emotions expressed by male versus female subjects than between different languages. Four feature domains, namely, RASTA, F0, MFCC and spectral energy, are algorithmically assessed as the most effective, refining existing literature and approaches based on standard sets. To our knowledge, this is one of the first studies encompassing cross-gender and cross-linguistic assessments on SER.

Keywords: English; SER; SVM; artificial intelligence; cross-gender; cross-linguistic; emotion recognition; machine learning; speech.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Number of speakers that uttered a certain number of clips (a) Female; (b) Male. As an example, 17 females uttered one clip, 20 females uttered 2 clips and, finally, one female uttered 25 clips (last point on the x-axis).
Figure 2
Figure 2
Flowchart for the SER Machine Learning framework.
Figure 3
Figure 3
Confusion matrices for the SVM classifier for the It M, It F, All M and All F comparisons. Emotion labels are thus abbreviated: “DIS” = Disgust; “HAP” = Happy; “FEAR” = Fear; “ANG” = Angry; “SAD” = Sad.

Similar articles

Cited by

References

    1. Seibert P.S., Ellis H.C. Irrelevant thoughts, emotional mood states, and cognitive task performance. Mem. Cognit. 1991;19:507–513. doi: 10.3758/BF03199574. - DOI - PubMed
    1. Frijda N.H. Handbook of Emotions. The Guilford Press; New York, NY, USA: 1993. Moods, emotion episodes, and emotions; pp. 381–403.
    1. Ellis H., Seibert P., Varner L. Emotion and memory: Effect of mood states on immediate and unexpected delayed recall. Psychol. J. Soc. Behav. Personal. 1995;10:349.
    1. Kwon O.-W., Chan K., Hao J., Lee T.-W. Emotion recognition by speech signals; Proceedings of the 8th European Conference on Speech Communication and Technology, Eurospeech 2003—Interspeech 2003; Geneva, Switzerland. 1–4 September 2003.
    1. El Ayadi M., Kamel M.S., Karray F. Survey on speech emotion recognition: Features, classification schemes, and databases. Pattern Recognit. 2011;44:572–587. doi: 10.1016/j.patcog.2010.09.020. - DOI

LinkOut - more resources