摘要

Motivated by the successful application of Local Binary Pattern (LBP), in this paper we propose a novel pixel neighborhood differential statistic feature for pedestrian and face detection based on the multiple channel maps. The calculation of LBP comprises of two steps, Pixel Differential Feature (PDF) calculation and PDF sign encoding. The PDF distills discriminative information of local region that can improve the performance of the pedestrian detector, but the encoding step degrades the performance due to the quantization error. Although PDF is more discriminative than original channel maps, it has a much higher dimension than the original feature maps, and consequently requiring large computation cost. To address this issue, the pixel neighborhood differential pattern is learned with both supervised and unsupervised learning methods, which allow discovering discriminative pixel differential patterns in local area and achieving state-of-the-art results. Specifically, our method firstly aggregates the image channel maps into cell maps with max pooling. Then, pixel neighborhood differential feature based on each channel cell maps are calculated which contributes to encoding discriminative information in each local area and benefits the performance improvements. In addition, we attempt to learn discriminative differential statistic patterns by using linear discriminative analysis (LDA) and principle component analysis (PCA) for further performance improvement. Two sets of experiments are conducted on pedestrian detection and face detection respectively. The INRIA, Caltech, and ETH datasets are used for pedestrian detection, and the FDDB and AFW datasets for multi-view face detection. The experimental results show that our method achieves superior performance in comparison with the state-of-the-arts while running at 20 fps for 480x640 images.