Augmented saliency model using automatic 3D head pose detection and learned gaze following in natural scenes

Parks Daniel<sup>*</sup>; Borji Ali; Itti Laurent

doi:10.1016/j.visres.2014.10.027

摘要

Previous studies have shown that gaze direction of actors in a scene influences eye movements of passive observers during free-viewing (Castelhano, Wieth, & Henderson, 2007; Borji, Parks, & Itti, 2014). However, no computational model has been proposed to combine bottom-up saliency with actor's head pose and gaze direction for predicting where observers look. Here, we first learn probability maps that predict fixations leaving head regions (gaze following fixations), as well as fixations on head regions (head fixations), both dependent on the actor's head size and pose angle. We then learn a combination of gaze following, head region, and bottom-up saliency maps with a Markov chain composed of head region and non-head region states. This simple structure allows us to inspect the model and make comments about the nature of eye movements originating from heads as opposed to other regions. Here, we assume perfect knowledge of actor head pose direction (from an oracle). The combined model, which we call the Dynamic Weighting of Cues model (DWOC), explains observers' fixations significantly better than each of the constituent components. Finally, in a fully automatic combined model, we replace the oracle head pose direction data with detections from a computer vision model of head pose. Using these (imperfect) automated detections, we again find that the combined model significantly outperforms its individual components. Our work extends the engineering and scientific applications of saliency models and helps better understand mechanisms of visual attention.

出版日期2015-11

全文

访问全文

收藏分享被引(21) 浏览

更新时间：2021-12-28 13:46

Augmented saliency model using automatic 3D head pose detection and learned gaze following in natural scenes

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友