DocumentCode :
2263002
Title :
Performance Prediction for MPI Parallel Jobs
Author :
Zhang, Weizhe ; Han, Tianyu ; Zhang, Yuanjing ; Cheng, Albert M K
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
fYear :
2012
fDate :
24-28 Sept. 2012
Firstpage :
136
Lastpage :
142
Abstract :
Performance prediction of run time in the cluster is the foundation of efficient resource management and task scheduling. Considering the defects and limitations of traditional methods based on the history and modeling analysis, this paper proposes a new approach based on the Performance Skeleton. Through the use of the MPI library´s PMPI interface, we can insert wrapper-functions to the source code, which can access all communication traces without changing the original program or affecting the operation of the original program. To merge these trace logs, we designed the trace log regularization and merging algorithm. For compressing circulatory traces, the most central and difficult problem, this paper converts it into a circular sub-string compression problem, and proposes an algorithm based on the suffix array. Its performance is better than the existing algorithms. To automatically reconstruct the Performance Skeleton, it solves the scalable problems of calculation and communication time. Experimental results show that these methods can accurately estimate the run time of computing jobs. The error is less than 3% for a homogeneous cluster.
Keywords :
application program interfaces; merging; message passing; parallel processing; pattern clustering; resource allocation; scheduling; MPI library PMPI interface; MPI parallel jobs; circular substring compression problem; communication traces; history analysis; homogeneous cluster; merging algorithm; modeling analysis; performance prediction; performance skeleton; resource management; source code; suffix array; task scheduling; trace log regularization; wrapper-functions; Algorithm design and analysis; Analytical models; Arrays; Computational modeling; Delay; Skeleton; circular sub-string compressing; parallel job; performance prediction; performance skeleton;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing Workshops (CLUSTER WORKSHOPS), 2012 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2893-7
Type :
conf
DOI :
10.1109/ClusterW.2012.13
Filename :
6355857
Link To Document :
بازگشت