Title :
Efficient parallel out-of-core matrix transposition
Author :
Krisnamoorthy ; Baumgartner, Gerald ; Cociorva, Daniel ; Lam, Chi-Chung ; Sadyappan
Author_Institution :
Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
Abstract :
This paper addresses the problem of parallel transposition of large out-of-core arrays. Although algorithms for out-of-core matrix transposition have been widely studied, previously proposed algorithms have sought to minimize the number of I/O operations and the in-memory permutation time. We propose an algorithm that directly targets the improvement of overall transposition time. The I/O characteristics of the system are used to determine the read, write and communication block sizes such that the total execution time is minimized. We also provide a solution to the array redistribution problem for arrays on disk. The solution to the sequential transposition problem and the parallel array redistribution problem are then combined to obtain an algorithm for the parallel out-of-core transposition problem.
Keywords :
input-output programs; matrix algebra; parallel algorithms; parallel programming; I/O characteristics; I/O operations; array redistribution problem; communication block size; disk arrays; execution time; in-memory permutation time; out-of-core arrays; parallel array redistribution; parallel out-of-core matrix transposition; read block size; sequential transposition problem; transposition time; write block size; Chemistry; Clustering algorithms; Computer input-output; Concurrent computing; Engines; Fast Fourier transforms; Flexible printed circuits; Information science; Information technology; Matrices; Multidimensional systems; Parallel algorithms; Parallel programming; Tensile stress;
Conference_Titel :
Cluster Computing, 2003. Proceedings. 2003 IEEE International Conference on
Print_ISBN :
0-7695-2066-9
DOI :
10.1109/CLUSTR.2003.1253328