DocumentCode :
3042891
Title :
SRUMMA: a matrix multiplication algorithm suitable for clusters and scalable shared memory systems
Author :
Krishnan, Manojkumar ; Nieplocha, Jarek
Author_Institution :
Computational Sci. & Math., Pacific Northwest Nat. Lab., Washington, DC, USA
fYear :
2004
fDate :
26-30 April 2004
Firstpage :
70
Abstract :
Summary form only given. We describe a novel parallel algorithm that implements a dense matrix multiplication operation with algorithmic efficiency equivalent to that of Cannon´s algorithm. It is suitable for clusters and scalable shared memory systems. The current approach differs from the other parallel matrix multiplication algorithms by the explicit use of shared memory and remote memory access (RMA) communication rather than message passing. The experimental results on clusters (IBM SP, Linux-Myrinet) and shared memory systems (SGI Altix, Cray XI) demonstrate consistent performance advantages over pdgemm from the ScaLAPACK/PBBLAS suite, the leading implementation of the parallel matrix multiplication algorithms used today. In the best case on the SGI Altix, the new algorithm performs 20 times better than pdgemm for a matrix size of 1000 on 128 processors. The impact of zero-copy nonblocking RMA communications and shared memory communication on matrix multiplication performance on clusters are investigated.
Keywords :
matrix multiplication; parallel algorithms; shared memory systems; workstation clusters; Cannon algorithm; SGI Altix; clusters shared memory systems; matrix multiplication algorithm; parallel algorithm; remote memory access communication; scalable shared memory systems; Algorithm design and analysis; Clustering algorithms; Concurrent computing; Distributed computing; Distributed processing; Hardware; Laboratories; Mathematics; Partitioning algorithms; Protocols;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN :
0-7695-2132-0
Type :
conf
DOI :
10.1109/IPDPS.2004.1303000
Filename :
1303000
Link To Document :
بازگشت