Action recognition by hidden temporal models

Wu, Jianzhai; Hu, Dewen<sup>*</sup>; Chen, Fanglin

doi:10.1007/s00371-013-0899-9

摘要

We focus on the recognition of human actions in uncontrolled videos that may contain complex temporal structures. It is a difficult problem because of the large intra-class variations in viewpoint, video length, motion pattern, etc. To address these difficulties, we propose a novel system in this paper that represents each action class by hidden temporal models. In this system, we represent the crucial action event per category by a video segment that covers a fixed number of frames and can move temporally within the sequences. To capture the temporal structures, the video segment is described by a temporal pyramid model. To capture large intra-class variations, multiple models are combined using Or operation to represent alternative structures. The index ofmodel and the start frame of segment are both treated as hidden variables. We implement a learning procedure based on the latent SVM method. The proposed approach is tested on two difficult benchmarks: the Olympic Sports and HMDB51 data sets. The experimental results reveal that our system is comparable to the state-of-the-art methods in the literature.

出版日期2014-12
单位中国人民解放军国防科学技术大学

全文

访问全文

收藏分享被引(9) 浏览

更新时间：2020-02-09 01:31

Action recognition by hidden temporal models

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友