Accelerating Divergent Applications on SIMD Architectures Using Neural Networks

Grigorian Beayna<sup>*</sup>; Reinman Glenn

doi:10.1145/2717311

摘要

The purpose of this research is to find a neural-network-based solution to the well-known problem of branch divergence in Single Instruction Multiple Data (SIMD) architectures. Our approach differs from existing techniques that handle branch (or control-flow) divergence, which use costly hardware modifications, low-utilization masking techniques, or static prediction methods. As we examine divergent applications, we characterize the degree of data-dependent control flow seen in each and isolate the code regions (or "kernels") that cause the most performance degradation due to branch divergence. We then train neural networks (NNs) offline to approximate these kernels and inject the NN computations directly into the applications as substitutes for the kernels they approximate. This essentially translates control flow into nondivergent computation, trading off precision for performance. As our methodology manipulates application source code directly, it is inherently platform agnostic and can be adopted as a general means for accelerating divergent applications on data-parallel architectures. In this article, we present the Neuralizer, an automated software flow for kernel identification, NN training, and NN integration, as well as supplementary user-controlled optimization techniques. Evaluating our approach on a variety of divergent applications run on a Graphics Processing Unit (GPU), we on average achieve performance gains of 13.6x and energy savings of 14.8x with 96% accuracy.

出版日期2015-4

全文

访问全文

收藏分享被引(16) 浏览

更新时间：2024-04-28 20:09

Accelerating Divergent Applications on SIMD Architectures Using Neural Networks

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友