Title :
Exploiting Task-Parallelism on GPU Clusters via OmpSs and rCUDA Virtualization
Author :
Adrián Castelló;Rafael Mayo;Judit Planas; Quintana-Ortí
Author_Institution :
Dept. de Ing. y Cienc. de Comput., Univ. Jaume I, Castellon, Spain
Abstract :
OmpSs is a task-parallel programming model consisting of a reduced collection of OpenMP-like directives, a front-end compiler, and a runtime system. This directive-based programming interface helps developers accelerate their application´s execution, e.g. in a cluster equipped with graphics processing units (GPUs), with a low programming effort. On the other hand, the virtualization package rCUDA provides seamless and transparent remote access to any CUDA GPU in a cluster, via the CUDA Driver and Runtime programming interfaces. In this paper we investigate the hurdles and practical advantages of combining these two technologies. Our experimental study targets two cluster configurations: a system where all the GPUs are located into a single cluster node, and a cluster with the GPUs distributed among the nodes. Two applications, the N-body particle simulation and the Cholesky factorization of a dense matrix, are employed to expose the bottlenecks and performance of a remote virtualization solution applied to these two OmpSs task-parallel codes.
Keywords :
"Graphics processing units","Servers","Virtualization","Runtime","Message systems","Programming","Instruction sets"
Conference_Titel :
Trustcom/BigDataSE/ISPA, 2015 IEEE
DOI :
10.1109/Trustcom.2015.626