摘要

This paper presents a novel local image descriptor that is robust to general image deformations, and its application to street landmark localization. A limitation with traditional image descriptors is that they use a single support region for each interest point. For general image deformations, the amount of deformation for each location varies and is unpredictable such that it is difficult to choose the best scale of the support region. To overcome this difficulty, we propose to use multiple support regions (MSRs) of different sizes surrounding an interest point. A feature vector is computed for each support region, and the concatenation of these feature vectors forms the descriptor for this interest point. Furthermore, we propose a new similarity measure model, a local-to-global similarity (LGS) model, for point matching that takes advantage of the multi-size support regions. Each support region acts as a 'weak' classifier and the weights of these classifiers are learned in an unsupervised manner. Based on LGS model, we propose a MSR oriented efficient subimage retrieval (MSR-ESR) for object localization. The proposed approach is evaluated on a number of images with real and synthetic deformations, and also 15 US street landmarks' images and videos. The experiment results show that our method outperforms existing techniques under different deformations.