DocumentCode :
2545582
Title :
Performance Model for Parallel Matrix Multiplication with Dryad: Dataflow Graph Runtime
Author :
Hui Li ; Fox, G. ; Qiu, Jian
Author_Institution :
Sch. of Inf. & Comput., Indiana Univ. Bloomington, Bloomington, IN, USA
fYear :
2012
fDate :
1-3 Nov. 2012
Firstpage :
675
Lastpage :
683
Abstract :
In order to meet the big data challenge of today´s society, several parallel execution models on distributed memory architectures have been proposed: MapReduce, Iterative MapReduce, graph processing, and dataflow graph processing. Dryad is a distributed data-parallel execution engine that model program as dataflow graphs. In this paper, we evaluated the runtime and communication overhead of Dryad in realistic settings. We proposed a performance model for Dryad implementation of parallel matrix multiplication (PMM) and extend the model to MPI implementations. We conducted experimental analyses in order to verify the correctness of our analytic model on a Windows cluster with up to 400 cores, Azure with up to 100 instances, and Linux cluster with up to 100 nodes. The final results show that our analytic model produces accurate predictions within 5% of the measured results. We proved some cases that using average communication overhead to model performance of parallel matrix multiplication jobs on common HPC clusters is the practical approach.
Keywords :
Linux; data flow graphs; distributed memory systems; iterative methods; matrix multiplication; memory architecture; parallel processing; Azure; Dryad; HPC cluster; Linux cluster; MPI implementation; PMM; Windows cluster; communication overhead; dataflow graph processing; dataflow graph runtime; distributed data-parallel execution engine; distributed memory architecture; iterative MapReduce; model program; parallel execution model; parallel matrix multiplication; runtime overhead; Analytical models; Broadcasting; Computational modeling; Data models; Matrix decomposition; Runtime environment; Dataflow Graph; Dryad; MPI; Matrix Multiplication; Performance Modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud and Green Computing (CGC), 2012 Second International Conference on
Conference_Location :
Xiangtan
Print_ISBN :
978-1-4673-3027-5
Type :
conf
DOI :
10.1109/CGC.2012.23
Filename :
6382889
Link To Document :
بازگشت