A bag-of-regions representation for video classification

Choi Min Kook; Wang Ziyu; Lee Hyun Gyu; Lee Sang Chul<sup>*</sup>

doi:10.1007/s11042-015-2876-y

摘要

A bag-of-regions (BoR) representation of a video sequence is a spatio-temporal tessellation for use in high-level applications such as video classifications and action recognitions. We obtain a BoR representation of a video sequence by extracting regions that exist in the majority of its frames and largely correspond to a single object. First, the significant regions are obtained using unsupervised frame segmentation based on the JSEG method. A tracking algorithm for splitting and merging the regions is then used to generate a relational graph of all regions in the segmented sequence. Finally, we perform a connectivity analysis on this graph to select the most significant regions, which are then used to create a high-level representation of the video sequence. We evaluated our representation using a SVM classifier for the video classification and achieved about 85 % average precision using the UCF50 dataset.

出版日期2016-3

全文

访问全文

收藏分享被引(1) 浏览

更新时间：2021-04-11 10:32

A bag-of-regions representation for video classification

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友