DocumentCode
2716808
Title
A flexible class of parallel matrix multiplication algorithms
Author
Gunnels, John ; Lin, Calvin ; Morrow, Greg ; van de Geijn, Robert
Author_Institution
Dept. of Comput. Sci., Texas Univ., Austin, TX, USA
fYear
1998
fDate
30 Mar-3 Apr 1998
Firstpage
110
Lastpage
116
Abstract
This paper explains why parallel implementation of matrix multiplication, a seemingly simple algorithm that can be expressed as one statement and three nested loops, is complex. Practical algorithms that use matrix multiplication tend to use matrices of disparate shapes, and the shape of the matrices can significantly impact the performance of matrix multiplication. We provide a class of algorithms that covers the spectrum of shapes encountered and demonstrate that good performance can be attained if the right algorithm is chosen. While the paper resolves a number of issues, it concludes with discussion of a number of directions yet to be pursued
Keywords
mathematics computing; matrix multiplication; parallel algorithms; program control structures; software performance evaluation; matrix shape; nested loops; parallel implementation; parallel matrix multiplication algorithms; performance; Algorithm design and analysis; Broadcasting; Councils; Geoscience; High performance computing; Ice; Mathematics; Memory architecture; NASA; Shape;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing Symposium, 1998. IPPS/SPDP 1998. Proceedings of the First Merged International ... and Symposium on Parallel and Distributed Processing 1998
Conference_Location
Orlando, FL
ISSN
1063-7133
Print_ISBN
0-8186-8404-6
Type
conf
DOI
10.1109/IPPS.1998.669898
Filename
669898
Link To Document