A Performance and Energy Comparison of Convolution on GPUs, FPGAs, and Multicore Processors

作者:Fowers Jeremy*; Brown Greg; Wernsing John; Stitt Greg
来源:ACM Transactions on Architecture and Code Optimization, 2013, 9(4): 25.
DOI:10.1145/2400682.2400684

摘要

Recent architectural trends have focused on increased parallelism via multicore processors and increased heterogeneity via accelerator devices (e.g., graphics-processing units, field-programmable gate arrays). Although these architectures have significant performance and energy potential, application designers face many device-specific challenges when choosing an appropriate accelerator or when customizing an algorithm for an accelerator. To help address this problem, in this article we thoroughly evaluate convolution, one of the most common operations in digital-signal processing, on multicores, graphics-processing units, and field-programmable gate arrays. Whereas many previous application studies evaluate a specific usage of an application, this article assists designers with design space exploration for numerous use cases by analyzing effects of different input sizes, different algorithms, and different devices, while also determining Pareto-optimal trade-offs between performance and energy.

  • 出版日期2013-1

全文