DocumentCode :
56339
Title :
Parallel Workload Modeling with Realistic Characteristics
Author :
Tran Ngoc Minh ; Thoai Nam ; Epema, Dick H. J.
Author_Institution :
Leiden Inst. of Adv. Comput. Sci., Leiden Univ., Leiden, Netherlands
Volume :
25
Issue :
8
fYear :
2014
fDate :
Aug. 2014
Firstpage :
2138
Lastpage :
2148
Abstract :
Workload modeling and performance evaluation play crucial roles in the study of scheduling algorithms on large-scale parallel and distributed systems. An effective design of a scheduling algorithm for these systems requires experiments with hundreds of simulations to evaluate its performance. Since each simulation needs one workload as input, only real workloads with usually a limited availability are not sufficient, and so representative workload models are needed. Several studies have shown that realistic workload characteristics such as burstiness, bag-of-tasks, etc., cause significant performance impacts on scheduling. Therefore, we argue that realistic workload models should contain as many characteristics of real workloads as possible. In practice, researchers use unrealistic workloads in their scheduling evaluations because they lack models that can help generate realistic workloads. In this article, we analyze real parallel workloads to show the presence of important characteristics including long range dependence, periodicity and temporal burstiness of job arrivals, bag-of-tasks behavior, and correlation of runtime and number of processors. Then, we present a systematic approach to create a complete model that contains all of these characteristics. Validation of our model with real world data shows that it does not only capture the above characteristics, but also can fit marginal distributions well.
Keywords :
parallel processing; scheduling; software performance evaluation; bag-of-tasks; distributed systems; large-scale parallel systems; long range dependence; marginal distributions; parallel workload modeling; performance evaluation; realistic characteristics; representative workload models; scheduling algorithms; scheduling evaluations; temporal job arrival burstiness; Correlation; Data models; Load modeling; Local area networks; Materials; Parallel processing; Runtime; Parallel workload modeling; bag-of-tasks; correlation; long range dependence; periodicity; temporal burstiness;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2013.182
Filename :
6567858
Link To Document :
بازگشت