Title :
A flexible class of parallel matrix multiplication algorithms
Author :
Gunnels, John ; Lin, Calvin ; Morrow, Greg ; van de Geijn, Robert
Author_Institution :
Dept. of Comput. Sci., Texas Univ., Austin, TX, USA
fDate :
30 Mar-3 Apr 1998
Abstract :
This paper explains why parallel implementation of matrix multiplication, a seemingly simple algorithm that can be expressed as one statement and three nested loops, is complex. Practical algorithms that use matrix multiplication tend to use matrices of disparate shapes, and the shape of the matrices can significantly impact the performance of matrix multiplication. We provide a class of algorithms that covers the spectrum of shapes encountered and demonstrate that good performance can be attained if the right algorithm is chosen. While the paper resolves a number of issues, it concludes with discussion of a number of directions yet to be pursued
Keywords :
mathematics computing; matrix multiplication; parallel algorithms; program control structures; software performance evaluation; matrix shape; nested loops; parallel implementation; parallel matrix multiplication algorithms; performance; Algorithm design and analysis; Broadcasting; Councils; Geoscience; High performance computing; Ice; Mathematics; Memory architecture; NASA; Shape;
Conference_Titel :
Parallel Processing Symposium, 1998. IPPS/SPDP 1998. Proceedings of the First Merged International ... and Symposium on Parallel and Distributed Processing 1998
Conference_Location :
Orlando, FL
Print_ISBN :
0-8186-8404-6
DOI :
10.1109/IPPS.1998.669898