DocumentCode :
3298859
Title :
Efficient Algorithms for Block-Cyclic Array Redistribution between Processor Sets
Author :
Park, Neungsoo ; Prasanna, Viktor K. ; Raghavendra, Cauligi
Author_Institution :
University of Southern California
fYear :
1998
fDate :
07-13 Nov. 1998
Firstpage :
10
Lastpage :
10
Abstract :
Run-time array redistribution is necessary to enhance the performance of parallel programs on distributed memory supercomputers. In this paper, we present an efficient algorithm for array redistribution from cyclic(x) on P processors to cyclic(Kx) on Q processors. The algorithm reduces the overall time for communication by considering the data transfer, communication schedule, and index computation costs. The proposed algorithm is based on a generalized circulant matrix formalism. Our algorithm generates a schedule that minimizes the number of communication steps and eliminates node contention in each communication step. The network bandwidth is fully utilized by ensuring that equal-sized messages are transferred in each communication step. Furthermore, the procedure to compute the schedule and the index sets is extremely fast. It takes O(max(P, Q)) time. Therefore, our proposed algorithm is suitable for run-time array redistribution. To evaluate the performance of our scheme, we have implemented the algorithm using C and MPI. The experiments were conducted on the IBM SP2. The experimental results show that the proposed algorithm outperforms well- known algorithms with respect to the total redistribution time including the data transfer and schedule and index computation times.
Keywords :
Bandwidth; Bibliographies; Computational efficiency; Costs; Multidimensional signal processing; Processor scheduling; Runtime; Scheduling algorithm; Signal processing algorithms; Supercomputers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing, 1998.SC98. IEEE/ACM Conference on
Print_ISBN :
0-8186-8707-X
Type :
conf
DOI :
10.1109/SC.1998.10035
Filename :
1437297
Link To Document :
بازگشت