Title :
Packing/unpacking information generation for efficient generalized kr→r and r→kr array redistribution
Author :
Hsu, Ching-Hsien ; Chung, Yeh-Ching ; Dow, Chyi-Ren
Author_Institution :
Dept. of Inf. Eng., Feng Chia Univ., Taichung, Taiwan
Abstract :
Array redistribution is usually required to enhance algorithm performance in many parallel programs on distributed memory multicomputers. Since it is performed at run-time, there is a performance tradeoff between the efficiency of new data decomposition for a subsequent phase of an algorithm and the cost of redistributing data among processors. In this paper, we present efficient methods to generate the packing/unpacking information for BOLCK-CYCLIC(kr) to BLOCK-CYCLIC(r) and BOLCK-CYCLIC(r) to BLOCK-CYCLIC(kr) redistribution with arbitrary source/destination processor sets. The most significant improvement of this paper is that a processor does not need to construct the send/receive data sets for a redistribution. Based on the packing/unpacking information derived from kr→r and r→kr redistributions, a processor can pack/unpack array elements into (from) messages directly. To evaluate the performance of our methods, we have implemented our methods along with the PITFALLS method and the Prylli´s method on an IBM SP2 parallel machine. The experimental results show that our algorithms outperform the PITFALLS method and the Prylli´s method for all test samples
Keywords :
distributed memory systems; parallel programming; algorithm performance; array redistribution; data decomposition; distributed memory multicomputers; packing; parallel programs; performance tradeoff; unpacking; Asynchronous communication; Costs; Equations; Fast Fourier transforms; Parallel programming; Phased arrays; Read only memory; Runtime; Terminology; Testing;
Conference_Titel :
Frontiers of Massively Parallel Computation, 1999. Frontiers '99. The Seventh Symposium on the
Conference_Location :
Annapolis, MD
Print_ISBN :
0-7695-0087-0
DOI :
10.1109/FMPC.1999.750588