Review

. 2025 Jun 27;10(7):418.

doi: 10.3390/biomimetics10070418.

A Comprehensive Review of Multimodal Emotion Recognition: Techniques, Challenges, and Future Directions

You Wu¹, Qingwei Mi¹, Tianhan Gao¹

Affiliations

PMID: 40710231
PMCID: PMC12292624
DOI: 10.3390/biomimetics10070418

Review

A Comprehensive Review of Multimodal Emotion Recognition: Techniques, Challenges, and Future Directions

You Wu et al. Biomimetics (Basel). 2025.

. 2025 Jun 27;10(7):418.

doi: 10.3390/biomimetics10070418.

Authors

You Wu¹, Qingwei Mi¹, Tianhan Gao¹

Affiliation

¹ Software College, Northeastern University, Shenyang 110169, China.

PMID: 40710231
PMCID: PMC12292624
DOI: 10.3390/biomimetics10070418

Abstract

This paper presents a comprehensive review of multimodal emotion recognition (MER), a process that integrates multiple data modalities such as speech, visual, and text to identify human emotions. Grounded in biomimetics, the survey frames MER as a bio-inspired sensing paradigm that emulates the way humans seamlessly fuse multisensory cues to communicate affect, thereby transferring principles from living systems to engineered solutions. By leveraging various modalities, MER systems offer a richer and more robust analysis of emotional states compared to unimodal approaches. The review covers the general structure of MER systems, feature extraction techniques, and multimodal information fusion strategies, highlighting key advancements and milestones. Additionally, it addresses the research challenges and open issues in MER, including lightweight models, cross-corpus generalizability, and the incorporation of additional modalities. The paper concludes by discussing future directions aimed at improving the accuracy, explainability, and practicality of MER systems for real-world applications.

Keywords: emotion analysis; feature extraction; information fusion; multimodal emotion recognition.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

**Figure 2**
Record counts at each PRISMA stage.

**Figure 3**
Distribution of modality types in 103 MER studies.

**Figure 4**
Annual trend of modality adoption, 2011–2025.

**Figure 5**
Word cloud of modality keyword frequencies, 2011–2025.

**Figure 6**
Prevalence of modalities by lead author country.

**Figure 7**
The workflow of the MER system.

See this image and copyright information in PMC

Cited by

Multi-Scale Temporal Fusion Network for Real-Time Multimodal Emotion Recognition in IoT Environments.
Yoon S, Kim B. Yoon S, et al. Sensors (Basel). 2025 Aug 14;25(16):5066. doi: 10.3390/s25165066. Sensors (Basel). 2025. PMID: 40871929 Free PMC article.

References

1. Abdullah S.M.S.A., Ameen S.Y.A., Sadeeq M.A., Zeebaree S. Multimodal emotion recognition using deep learning. J. Appl. Sci. Technol. Trends. 2021;2:73–79. doi: 10.38094/jastt20291. - DOI
1. Adel O., Fathalla K.M., Abo ElFarag A. MM-EMOR: Multi-modal emotion recognition of social media using concatenated deep learning networks. Big Data Cogn. Comput. 2023;7:164. doi: 10.3390/bdcc7040164. - DOI
1. Bahreini K., Nadolski R., Westera W. Towards multimodal emotion recognition in e-learning environments. Interact. Learn. Environ. 2016;24:590–605. doi: 10.1080/10494820.2014.908927. - DOI
1. Ghaleb E., Popa M., Asteriadis S. Multimodal and temporal perception of audio-visual cues for emotion recognition; Proceedings of the 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII); Cambridge, UK. 3–6 September 2019; pp. 552–558.
1. He Z., Li Z., Yang F., Wang L., Li J., Zhou C., Pan J. Advances in multimodal emotion recognition based on brain–computer interfaces. Brain Sci. 2020;10:687. doi: 10.3390/brainsci10100687. - DOI - PMC - PubMed

Publication types

Actions

Grants and funding

52130403/National Natural Science Foundation of China

LinkOut - more resources

Full Text Sources
- MDPI
- PubMed Central
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A Comprehensive Review of Multimodal Emotion Recognition: Techniques, Challenges, and Future Directions

Affiliation

A Comprehensive Review of Multimodal Emotion Recognition: Techniques, Challenges, and Future Directions

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous