Hybrid generative-discriminative human action recognition by combining spatiotemporal words with supervised topic models

Sun, Hao<sup>*</sup>; Wang, Cheng; Wang, Boliang

doi:10.1117/1.3537969

摘要

We present a hybrid generative-discriminative learning method for human action recognition from video sequences. Our model combines a bag-of-words component with supervised latent topic models. A video sequence is represented as a collection of spatiotemporal words by extracting space-time interest points and describing these points using both shape and motion cues. The supervised latent Dirichlet allocation (sLDA) topic model, which employs discriminative learning using labeled data under a generative framework, is introduced to discover the latent topic structure that is most relevant to action categorization. The proposed algorithm retains most of the desirable properties of generative learning while increasing the classification performance though a discriminative setting. It has also been extended to exploit both labeled data and unlabeled data to learn human actions under a unified framework. We test our algorithm on three challenging data sets: the KTH human motion data set, the Weizmann human action data set, and a ballet data set. Our results are either comparable to or significantly better than previously published results on these data sets and reflect the promise of hybrid generative-discriminative learning approaches.

出版日期2011-2
单位中国人民解放军国防科学技术大学; 厦门大学

全文

访问全文

收藏分享被引(1) 浏览

更新时间：2019-08-15 15:34

Hybrid generative-discriminative human action recognition by combining spatiotemporal words with supervised topic models

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友