A visual attention based ROI detection method for facial expression recognition

Sun, Wenyun; Zhao, Haitao; Jin, Zhong<sup>*</sup>

doi:10.1016/j.neucom.2018.03.034

摘要

In this paper, an eleven-layered Convolutional Neural Network with Visual Attention is proposed for facial expression recognition. The network is composed of three components. First, local convolutional features of faces are extracted by a stack of ten convolutional layers. Second, the regions of interest are automatically determined according to these local features by the embedded attention model. Third, the local features in these regions are aggregated and used to infer the emotional label. These three components are integrated into a single network which can be trained in an end-to-end scheme. Extensive experiments on four kinds of data (namely aligned frontal faces, faces in different poses, aligned unconstrained faces, and grouped unconstrained faces) prove that the proposed method can improve the accuracy and obtain good visualization. The visualization shows that the learned regions of interest are partly consistent with the locations of emotion specific Action Units. This founding confirms the interpretation of Facial Action Coding System and Emotional Facial Action Coding System from a machine learning perspective.

出版日期2018-6-28
单位华东理工大学; 南京理工大学

全文

访问全文

收藏分享被引(55) 浏览

更新时间：2022-08-11 12:21

A visual attention based ROI detection method for facial expression recognition

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友