A parallel scheme for accelerating parameter sweep applications on a GPU

作者:Ino Fumihiko*; Shigeoka Kentaro; Okuyama Tomohiro; Motokubota Masaya; Hagihara Kenichi
来源:Concurrency and Computation-Practice & Experience, 2014, 26(2): 516-531.
DOI:10.1002/cpe.3016

摘要

This paper proposes a parallel scheme for accelerating parameter sweep applications on a graphics processing unit. By using hundreds of cores on the graphics processing unit, we found that our scheme simultaneously processes multiple parameters rather than a single parameter. The simultaneous sweeps exploit the similarity of computing behaviors shared by different parameters, thus allowing memory accesses to be coalesced into a single access if similar irregularities appear among the parameters' computational tasks. In addition, our scheme reduces the amount of off-chip memory access by unifying the data that are commonly referenced by multiple parameters and by placing the unified data in the fast on-chip memory. In several experiments, we applied our scheme to practical applications and found that our scheme can perform up to 8.5times faster than a naive scheme that processes a single parameter at a time. We also include a discussion on application characteristics that are required for our scheme to outperform the naive scheme.

  • 出版日期2014-2

全文