Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition

Deng Jun<sup>*</sup>; Zhang Zixing; Eyben Florian; Schuller Bjoern

doi:10.1109/LSP.2014.2324759

摘要

With the availability of speech data obtained from different devices and varied acquisition conditions, we are often faced with scenarios, where the intrinsic discrepancy between the training and the test data has an adverse impact on affective speech analysis. To address this issue, this letter introduces an Adaptive Denoising Autoencoder based on an unsupervised domain adaptation method, where prior knowledge learned from a target set is used to regularize the training on a source set. Our goal is to achieve a matched feature space representation for the target and source sets while ensuring target domain knowledge transfer. The method has been successfully evaluated on the 2009 INTERSPEECH Emotion Challenge's FAU Aibo Emotion Corpus as target corpus and two other publicly available speech emotion corpora as sources. The experimental results show that our method significantly improves over the baseline performance and outperforms related feature domain adaptation methods.

出版日期2014-9

全文

访问全文

收藏分享被引(110) 浏览

更新时间：2021-04-24 13:45

Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友