A translation framework for executing the sequential binary code on CPU/GPU based architectures

Zhu, Erzhou; Guan, Haibing<sup>*</sup>; Dong, Guoxing; Yang, Yindong; Yang, Hongbo

doi:10.4304/jsw.6.12.2331-2340

摘要

The method of using DBT (dynamic binary translation) to execute the source ISAs binary code on target platforms has been perplexed by low overhead for many years. GPU as a many-core processor has tremendous computational power. Employing GPU as a coprocessor to parallel execute the hot spot of binary code hold a great promise of substantially reduce the overhead of DBT. This paper presents a novel translation framework for constructing the virtual execution environment aiming at accelerating the process of DBT on CPU/GPU based architectures. With parallelizable parts (hot spots) of binary code and their related information, the framework converts the sequential code into PTX form and executes them on GPUs. Under the framework, we need not to rewrite the source code, and the binary compatibility issues between different GPUs are also resolved properly. Experimental results on several programs from CUDA SDK Code Samples and Parboil Benchmark Suite show that the framework can significantly improve the performance, usually have 10X speedup on average compared to X86 native platforms. Especially, when the scale of input become larger, the performance becomes even better.

出版日期2011
单位上海交通大学

全文

访问全文

收藏分享被引浏览

更新时间：2019-09-22 16:48

A translation framework for executing the sequential binary code on CPU/GPU based architectures

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友