DocumentCode :
1987580
Title :
MVAPICH2-MIC: A High Performance MPI Library for Xeon Phi Clusters with InfiniBand
Author :
Potluri, Sreeram ; Hamidouche, Khaled ; Bureddy, D. ; Panda, Dhabaleswar K.
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
fYear :
2013
fDate :
15-16 Aug. 2013
Firstpage :
25
Lastpage :
32
Abstract :
Intel´s Xeon Phi coprocessor, based on Many Integrated Core architecture, packs more than 1TFLOP of performance on a single chip and offers x86 compatibility. While MPI libraries can run out-of-the-box on the Xeon Phi coprocessors, it is critical to tune them for the new architecture and to redesign them using any new system level features offered in order to deliver performance. In this paper, we discuss the tuning and redesign of the MVAPICH2 MPI library for efficient intra-node and inter-node point-to-point communication on XeonPhi clusters with InfiniBand. We evaluate the designs using micro-benchmarks and application kernels. The results show significant improvements in performance of intra-MIC, intranode and internode communication. For the internode MIC-MIC path, the latency of 4M messages is reduced by 65% and the bandwidth for the same message size is improved by 5 times. The designs show 50% and 16% improvement in performance of 3DStencil communication kernel and P3DFFT library on 32 and 8 nodes, respectively. We discuss the challenges involved in providing a further optimized MVAPICH2 MPI library for Xeon Phi clusters.
Keywords :
application program interfaces; coprocessors; message passing; multiprocessing systems; software libraries; 3DStencil communication kernel; InfiniBand; Intel Xeon Phi coprocessor; MVAPICH2-MIC; P3DFFT library; Xeon Phi clusters; application kernels; high performance MPI library; inter-node point-to-point communication; internode MIC-MIC path; intra-node point-to-point communication; many integrated core architecture; microbenchmarks; system level features; x86 compatibility; Bandwidth; Bridges; Computer architecture; Coprocessors; Libraries; Microwave integrated circuits; Peer-to-peer computing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Extreme Scaling Workshop (XSW), 2013
Conference_Location :
Boulder, CO
Type :
conf
DOI :
10.1109/XSW.2013.8
Filename :
6805039
Link To Document :
بازگشت