摘要

The cognitive performance-based dimensional emotion recognition in whispered speech is studied. First, the whispered speech emotion databases and data collection methods are compared, and the character of emotion expression in whispered speech is studied, especially the basic types of emotions. Secondly, the emotion features for whispered speech is analyzed, and by reviewing the latest references, the related valence features and the arousal features are provided. The effectiveness of valence and arousal features in whispered speech emotion classification is studied. Finally, the Gaussian mixture model is studied and applied to whispered speech emotion recognition. The cognitive performance is also considered in emotion recognition so that the recognition errors of whispered speech emotion can be corrected. Based on the cognitive scores, the emotion recognition results can be improved. The results show that the formant features are not significantly related to arousal dimension, while the short-term energy features are related to the emotion changes in arousal dimension. Using the cognitive scores, the recognition results can be improved.

  • 出版日期2015

全文