Data-centric Combinatorial Optimization of Parallel Code

作者:Luo Hao*; Chen Guoyang; Li Pengcheng; Ding Chen; Shen Xipeng
来源:ACM Sigplan Notices, 2016, 51(8): 379-380.
DOI:10.1145/2851141.2851182

摘要

Memory performance is one essential factor for tapping into the full potential of the massive parallelism of GPU. It has motivated some recent efforts in GPU cache modeling. This paper presents a new data-centric way to model the performance of a system with heterogeneous memory resources. The new model is composable, meaning it can predict the performance difference due to placing data differently by profiling the execution just once.

  • 出版日期2016-8

全文