DocumentCode :
1996961
Title :
Synchronization and Ordering Semantics in Hybrid MPI+GPU Programming
Author :
Aji, Ashwin M. ; Balaji, Pavan ; Dinan, James ; Wu-Chun Feng ; Thakur, Rahul
fYear :
2013
fDate :
20-24 May 2013
Firstpage :
1020
Lastpage :
1029
Abstract :
Despite the vast interest in accelerator-based systems, programming large multinode GPUs is still a complex task, particularly with respect to optimal data movement across the host-GPU PCIe connection and then across the network. In order to address such issues, GPU-integrated MPI solutions have been developed that integrate GPU data movement into existing MPI implementations. Currently available GPU-integrated frameworks differ in aspects related to the buffer synchronization and ordering semantics they provide to users. The noteworthy models are (1) unified virtual addressing (UVA)-based approach and (2) MPI attributes-based approach. In this paper, we compare these approaches, for both programmability and performance, and demonstrate that the UVA-based design is useful for isolated communication with no data dependencies or ordering requirements, while the attributes-based design might be more appropriate when multiple interdependent MPI and GPU operations are interleaved.
Keywords :
application program interfaces; graphics processing units; message passing; MPI attributes-based approach; UVA-based approach; UVA-based design; attributes-based design; hybrid MPI+GPU programming; isolated communication; unified virtual addressing; Data transfer; Graphics processing units; Kernel; Performance evaluation; Programming; Semantics; Synchronization; CUDA; GPGPU; MPI; MPI-ACC; OpenCL; Unified Virtual Addressing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location :
Cambridge, MA
Print_ISBN :
978-0-7695-4979-8
Type :
conf
DOI :
10.1109/IPDPSW.2013.256
Filename :
6650987
Link To Document :
بازگشت