摘要

Multiple global memory access may lead to serious bottlenecks in GPU (Graphic Processing Unit) kernels. Global memory access congestion brings low throughput as well as bad performance. In the paper, the crucial characteristics of global memory access are analysed. Then a global memory access congestion judging model based on grey clustering is proposed, which can make classification for the congestion degree of global memory access. After analyzing the congestion objects and choosing the access data, optimization is carried out by a grey target decision model based on cobweb area. So the congestion is relieved. The proposed model is evaluated with several benchmarks on NVIDIA GTX 750. Comparing with the original kernels, experimental results demonstrate that the model can achieve 11.09% improvement of global memory throughput averagely.