Title :
TupleQ: Fully-asynchronous and zero-copy MPI over InfiniBand
Author :
Koop, Matthew J. ; Sridhar, Jaidev K. ; Panda, Dhabaleswar K.
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Abstract :
The message passing interface (MPI) is the defacto standard for parallel programming. As system scales increase, application writers often try to increase the overlap of communication and computation. Unfortunately, even on offloaded hardware such as InfiniBand, performance is not improved since the underlying protocols within MPI implementation require control messages that prevent overlap without expensive threads. In this work we propose a fully-asynchronous and zero-copy design to allow full overlap of communication and computation. We design TupleQ with novel use of InfiniBand eXtended Reliable Connection (XRC) receive queues to allow zero-copy and asynchronous transfers for all message sizes. Our evaluation on 64 tasks reveals significant performance gains. By leveraging the network hardware we are able to provide fully-asynchronous progress. We show overlap of nearly 100% for all message sizes, compared to 0% for the traditional RPUT and RGET protocols. We also show a 27% improvement for NAS SP using our design over the existing designs.
Keywords :
application program interfaces; message passing; parallel programming; workstation clusters; InfiniBand; TupleQ; asynchronous transfers; cluster computing; extended reliable connection; fully-asynchronous MPI; message passing interface; network hardware; offloaded hardware; parallel programming; zero-copy MPI; zero-copy transfers; Communication system control; Computer science; Hardware; Libraries; Message passing; Parallel programming; Performance gain; Protocols; Sun; Yarn;
Conference_Titel :
Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
Conference_Location :
Rome
Print_ISBN :
978-1-4244-3751-1
Electronic_ISBN :
1530-2075
DOI :
10.1109/IPDPS.2009.5161056