Embedding Visual Hierarchy With Deep Networks for Large-Scale Visual Recognition

Zhao, Tianyi; Zhang, Baopeng; He, Ming; Zhang, Wei; Zhou, Ning; Yu, Jun; Fan, Jianping<sup>*</sup>

doi:10.1109/TIP.2018.2845118

摘要

In this paper, a layer-wise mixture model (LMM) is developed to support hierarchical visual recognition, where a Bayesian approach is used to automatically adapt the visual hierarchy to the progressive improvements of the deep network along the time. Our LMM algorithm can provide an end-to-end approach for jointly learning: 1) the deep network for achieving more discriminative deep representations for object classes and their inter-class visual similarities; 2) the tree classifier for recognizing large numbers of object classes hierarchically; and 3) the visual hierarchy adaptation for achieving more accurate assignment and organization of large numbers of object classes. By learning the tree classifier, the deep network and the visual hierarchy adaptation jointly in an end-to-end manner, our LMM algorithm can achieve higher accuracy rates on hierarchical visual recognition. Our experiments are carried on ImageNet1K and ImageNet10K image sets, which have demonstrated that our LMM algorithm can achieve very competitive results on the accuracy rates as compared with the baseline methods.

出版日期2018-10
单位北京交通大学; 复旦大学; 杭州电子科技大学

全文

访问全文

收藏分享被引(27) 浏览

更新时间：2024-05-11 08:45

Embedding Visual Hierarchy With Deep Networks for Large-Scale Visual Recognition

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友