Title :
Custom Assignment of MPI Ranks for Parallel Multi-dimensional FFTs: Evaluation of BG/P versus BG/L
Author :
Jagode, Heike ; Hein, Joachim
Author_Institution :
Oak Ridge Nat. Lab. (ORNL), Univ. of Tennessee - Knoxville, Knoxville, TN
Abstract :
For many scientific applications, the fast Fourier transformation (FFT) of multi-dimensional data is the kernel that limits scalability on a large number of processors. This paper investigates the extent of performance improvements for a parallel three-dimensional FFT (3D-FFT) implementation when using customized MPI task mappings. The MPI tasks are mapped in a customized fashion from the two-dimensional virtual processor grid of the algorithm to the physical hardware of a system with a mesh interconnect. We compare and analyze the outcomes on Blue Gene/P with those from previous investigations on Blue Gene/L. The performance analysis is based on bandwidth considerations. The results demonstrate that on Blue Gene/P, a carefully chosen MPI task mapping with regards to the network characteristics is more important compared to Blue Gene/L and yields significant improvement.
Keywords :
application program interfaces; fast Fourier transforms; mathematics computing; message passing; parallel machines; Blue Gene/L; Blue Gene/P; customized MPI task mapping; fast Fourier transformation; mesh interconnect; multidimensional data; parallel multidimensional FFT implementation; performance analysis; scientific application; two-dimensional virtual processor grid; Communication networks; Differential equations; Distributed processing; Flexible printed circuits; Hardware; Laboratories; Performance analysis; Performance gain; Polynomials; Scalability; Blue Gene; FFT; MPI task mapping; MPI task placement; mesh communication network;
Conference_Titel :
Parallel and Distributed Processing with Applications, 2008. ISPA '08. International Symposium on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3471-8
DOI :
10.1109/ISPA.2008.136