DocumentCode
3731301
Title
A generic and adaptive approach for workload distribution in multi-tier cluster systems with an application to distributed matrix multiplication
Author
Darius Malysiak;Thomas Kopinski
Author_Institution
Hochschule Ruhr West Computer Science Institute, Bottrop, Germany
fYear
2015
Firstpage
255
Lastpage
266
Abstract
We present a novel approach of distributing matrix multiplications among GPU-equipped nodes in a cluster system. In this context we discuss the induced challenges and possible solutions. Additionally we state an algorithm which outperforms optimized GPU BLAS libraries for small matrices. Furthermore we provide a novel theoretical model for distributing algorithms within homogeneous computation systems with multiple hierarchies. In the context of this model we develop an algorithm which can find the optimal distribution parameters for each involved subalgorithm. We provide a detailed analysis of the algorithms space and time complexities and justify its use with a structured evaluation within a small GPU-equipped Beowulf cluster.
Keywords
"Clustering algorithms","Complexity theory","Graphics processing units","Algorithm design and analysis","Computational modeling","Processor scheduling","Instruction sets"
Publisher
ieee
Conference_Titel
Computational Intelligence and Informatics (CINTI), 2015 16th IEEE International Symposium on
Type
conf
DOI
10.1109/CINTI.2015.7382933
Filename
7382933
Link To Document