A uniform approach for programming distributed heterogeneous computing systems

作者:Grasso Ivan*; Pellegrini Simone; Cosenza Biagio; Fahringer Thomas
来源:Journal of Parallel and Distributed Computing, 2014, 74(12): 3228-3239.
DOI:10.1016/j.jpdc.2014.08.002

摘要

Large-scale compute clusters of heterogeneous nodes equipped with multi-core CPUs and GPUs are getting increasingly popular in the scientific community. However, such systems require a combination of different programming paradigms making application development very challenging. %26lt;br%26gt;In this article we introduce libWater, a library-based extension of the OpenCL programming model that simplifies the development of heterogeneous distributed applications. libWater consists of a simple interface, which is a transparent abstraction of the underlying distributed architecture, offering advanced features such as inter-context and inter-node device synchronization. It provides a runtime system which tracks dependency information enforced by event synchronization to dynamically build a DAG of commands, on which we automatically apply two optimizations: collective communication pattern detection and device-host-device copy removal. %26lt;br%26gt;We assess libWater%26apos;s performance in three compute clusters available from the Vienna Scientific Cluster, the Barcelona Supercomputing Center and the University of Innsbruck, demonstrating improved performance and scaling with different test applications and configurations.

  • 出版日期2014-12