Multimodal Emotion Recognition in Multi-Cultural Conditions

Chen, Shi Zhe; Wang, Shuai; Jin, Qin<sup>*</sup>

doi:10.13328/j.cnki.jos.005412

摘要

Automatic emotion recognition is a challenging task with a wide range of applications. This paper addresses the problem of emotion recognition in multi-cultural conditions. Different multi-modal features are extracted from audio and visual modalities, and the emotion recognition performance is compared between hand-crafted features and automatically learned features from deep neural networks. Multimodal feature fusion is also explored to combine different modalities. The CHEAVD Chinese multimodal emotion dataset and AFEW English multimodal emotion dataset are utilized to evaluate the proposed methods. The importance of the culture factor for emotion recognition through cross-culture emotion recognition is demonstrated, and then three different strategies, including selecting corresponding emotion model for different cultures, jointly training with multi-cultural datasets, and embedding features from multi-cultural datasets into the same emotion space, are developed to improve the emotion recognition performance in the multi-cultural environment. The embedding strategy separates the culture influence from original features and can generate more discriminative emotion features, resulting in best performance for acoustic and multimodal emotion recognition.

出版日期2018
单位中国人民大学

全文

访问全文

收藏分享被引浏览

更新时间：2021-04-26 22:35

Multimodal Emotion Recognition in Multi-Cultural Conditions

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友