Accelerating image convolution filtering algorithms on integrated CPU-GPU architectures

Zhou, Yi; He, Fazhi<sup>*</sup>; Qiu, Yimin

doi:10.1117/1.JEI.27.3.033002

摘要

Convolution filtering is one of the most important algorithms in image processing. It is data-intensive, especially when dealing with high-definition images. Most previous studies on accelerating convolution computation in parallel focus on the use of graphics processing units (GPUs), whereas the central processing units (CPUs) always play the role of host to manage the data buffer and control flow. However, recent CPU architectures have seen significant modifications to parallel data computing capabilities, and the trend of integrating the CPU and GPU on a single chip is on a rise. We propose an approach to accelerate convolution filtering on the heterogeneous architecture of integrated CPU-GPU. We exploit the parallel processing power of vector instructions on a CPU and make it collaboratively function with the on-chip GPU. Two task assignment methods, static and dynamic task partitioning, are proposed for CPU-GPU collaboration. We evaluate our approach with images and filters of different sizes. The experimental results demonstrate that we can achieve 146 GFLOP/s at best using a quad-core CPU and the performance is 2.5 to 4.8 times faster than that of the single-GPU version of the OpenCV library. We also obtain 90 times speedup over the single-threaded CPU version. The results demonstrate that the proposed algorithm is efficient.

出版日期2018-5
单位武汉大学; 武汉科技大学

全文

访问全文

收藏分享被引(7) 浏览

更新时间：2024-04-13 03:04

Accelerating image convolution filtering algorithms on integrated CPU-GPU architectures

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友