DocumentCode :
3303818
Title :
Performance Evaluation of Open MPI on Cray XE/XK Systems
Author :
Gutierrez, Samuel K. ; Hjelm, Nathan T. ; Venkata, Manjunath Gorentla ; Graham, Richard L.
Author_Institution :
High Performance Comput. Div., Los Alamos Nat. Lab., Los Alamos, NM, USA
fYear :
2012
fDate :
22-24 Aug. 2012
Firstpage :
40
Lastpage :
47
Abstract :
Open MPI is a widely used open-source implementation of the MPI-2 standard that supports a variety of platforms and interconnects. Current versions of Open MPI, however, lack support for the Cray XE6 and XK6 architectures -- both of which use the Gemini System Interconnect. In this paper, we present extensions to natively support these architectures within Open MPI, describe and propose solutions for performance and scalability bottlenecks, and provide an extensive evaluation of our implementation, which is the first completely open-source MPI implementation for the Cray XE/XK system families used at 49,152 processes. Application and micro-benchmark results show that the performance and scaling characteristics of our implementation are similar to the vendor-supplied MPI´s. Micro-benchmark results show short-data 1-byte and 1,024-byte message latencies of 1.20 μs and 4.13 μs, which are 10.00% and 39.71% better than the vendor-supplied MPI´s, respectively. Our implementation achieves a bandwidth of 5.32 GB/s at 8 MB, which is similar to the vendor-supplied MPI´s bandwidth at the same message size. Two Sequoia benchmark applications, LAMMPS and AMG2006, were also chosen to evaluate our implementation at scales up to 49,152 cores -- where we exhibited similar performance and scaling characteristics when compared to the vendor-supplied MPI implementation. LAMMPS achieved a parallel efficiency of 88.20% at 49,152 cores using Open MPI, which is on par with the vendor-supplied MPI´s achieved parallel efficiency.
Keywords :
Cray computers; message passing; parallel architectures; AMG2006; Cray XE/XK system; Cray XE6 architecture; Cray XK6 architecture; Gemini system interconnect; LAMMPS; MPI-2 standard; Sequoia benchmark application; efficiency 88.20 percent; message latency; message size; open-source MPI; open-source implementation; parallel efficiency; performance evaluation; scaling characteristics; time 1.20 mus; time 4.13 mus; vendor-supplied MPI; Bandwidth; Benchmark testing; Libraries; Open source software; Protocols; Receivers; Cray; Gemini; Open MPI; XPMEM; uGNI;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High-Performance Interconnects (HOTI), 2012 IEEE 20th Annual Symposium on
Conference_Location :
Santa Clara, CA
Print_ISBN :
978-1-4673-2836-4
Type :
conf
DOI :
10.1109/HOTI.2012.11
Filename :
6299071
Link To Document :
بازگشت