Recognizing in the depth: Selective 3D Spatial Pyramid Matching Kernel for object and scene categorization

Redondo Cabrera Carolina<sup>*</sup>; Lopez Sastre Roberto J; Acevedo Rodriguez Javier; Maldonado Bascon Saturnino

doi:10.1016/j.imavis.2014.08.007

摘要

This paper proposes a novel approach to recognize object and scene categories in depth images. We introduce a Bag of Words (BoW) representation in 3D, the Selective 3D Spatial Pyramid Matching Kernel (3DSPMK). It starts quantizing 3D local descriptors, computed from point clouds, to build a vocabulary of 3D visual words. This codebook is used to build the 3DSPMK, which starts partitioning a working volume into fine sub-volumes, and computing a hierarchical weighted sum of histogram intersections of visual words at each level of the 3D pyramid structure. With the aim of increasing both the classification accuracy and the computational efficiency of the kernel, we propose two selective hierarchical volume decomposition strategies, based on representative and discriminative sub-volume selection processes, which drastically reduce the pyramid to consider. Results on different RGBD datasets show that our approaches obtain state-of-the-art results for both object recognition and scene categorization.

出版日期2014-12

全文

访问全文

收藏分享被引(3) 浏览

更新时间：2019-02-21 03:05

Recognizing in the depth: Selective 3D Spatial Pyramid Matching Kernel for object and scene categorization

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友