A Configurable Parallel Hardware Architecture for Efficient Integral Histogram Image Computing

作者:Yin, Shouyi*; Ouyang, Peng*; Chen, Tianbao*; Liu, Leibo*; Wei, Shaojun*
来源:IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2016, 24(4): 1305-1318.
DOI:10.1109/TVLSI.2015.2462752

摘要

Integral histogram image can accelerate the computing process of feature algorithm in computer vision, but exhibits high computation complexity and inefficient memory access. In this paper, we propose a configurable parallel architecture to improve the computing efficiency of integral histogram. Based on the configurable design in the architecture, multiple integral objects for integral histogram image, such as image intensity, image gradient, and local binary pattern, are well supported. Meanwhile, by means of the proposed strip-based memory partitioning mechanism, this architecture processes the integral histogram quickly with maximal parallelism in a pipeline manner. Besides, in this architecture, the proposed data correlation memory compression mechanism effectively solves the expansion problem of integral histogram memory caused by storing the histogram data. It fully reduces the data redundancy in the integral histograms, and saves a lot of memory resources. Experiments using Cyclone IV-based field-programmable gate array platform and 65-nm technology-based postsynthesis show that our architecture improves the average computing speed by 8.6 times with high power efficiency compared with the state-of-the-art works.