DocumentCode
2719606
Title
Mixed mode matrix multiplication
Author
Meng-Shiou Wu ; Aluru, Srinivas
Author_Institution
Dept. of Electr. & Comput. Eng., USDOE, Ames, IA
fYear
2002
fDate
2002
Firstpage
195
Lastpage
203
Abstract
In modern clustering environments where the memory hierarchy has many layers (distributed memory, shared memory layer, cache, ...), an important question is how to fully utilize all available resources and identify the most dominant layer in certain computation. When combining algorithms on all layers together, what would be the best method to get the best performance out of all the resources we have? The mixed mode programming model that uses thread programming on the shared memory layer and message passing programming on the distributed memory layer is a method that many researchers are using to utilize the memory resources. We take an algorithmic approach that uses matrix multiplication as a tool to show how cache algorithms affect the performance of both shared memory and distributed memory algorithms. We show that with good underlying cache algorithm, overall performance is stable. When the underlying cache algorithm is bad, superlinear speedup may occur and increasing number of threads may also improve performance.
Keywords
application program interfaces; cache storage; distributed shared memory systems; matrix multiplication; message passing; multi-threading; performance evaluation; workstation clusters; cache algorithms; computer clustering; distributed memory layer; memory hierarchy; message passing programming; mixed mode matrix multiplication; mixed mode programming model; performance; shared memory layer; superlinear speedup; thread programming; Clustering algorithms; Computational modeling; Computer science; Data communication; Distributed computing; High-speed networks; Laboratories; Message passing; US Department of Energy; Yarn;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing, 2002. Proceedings. 2002 IEEE International Conference on
Print_ISBN
0-7695-2066-9
Type
conf
DOI
10.1109/CLUSTR.2002.1137747
Filename
1137747
Link To Document