Action Recognition from Video Using Feature Covariance Matrices

Guo Kai<sup>*</sup>; Ishwar Prakash; Konrad Janusz

doi:10.1109/TIP.2013.2252622

摘要

We propose a general framework for fast and accurate recognition of actions in video using empirical covariance matrices of features. A dense set of spatio-temporal feature vectors are computed from video to provide a localized description of the action, and subsequently aggregated in an empirical covariance matrix to compactly represent the action. Two supervised learning methods for action recognition are developed using feature covariance matrices. Common to both methods is the transformation of the classification problem in the closed convex cone of covariance matrices into an equivalent problem in the vector space of symmetric matrices via the matrix logarithm. The first method applies nearest-neighbor classification using a suitable Riemannian metric for covariance matrices. The second method approximates the logarithm of a query covariance matrix by a sparse linear combination of the logarithms of training covariance matrices. The action label is then determined from the sparse coefficients. Both methods achieve state-of-the-art classification performance on several datasets, and are robust to action variability, viewpoint changes, and low object resolution. The proposed framework is conceptually simple and has low storage and computational requirements making it attractive for real-time implementation.

出版日期2013-6

全文

访问全文

收藏分享被引(96) 浏览

更新时间：2024-04-16 05:40

Action Recognition from Video Using Feature Covariance Matrices

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友