DocumentCode :
3090072
Title :
Towards Synthesizing Realistic Workload Traces for Studying the Hadoop Ecosystem
Author :
Wang, Guanying ; Butt, Ali R. ; Monti, Henry ; Gupta, Karan
fYear :
2011
fDate :
25-27 July 2011
Firstpage :
400
Lastpage :
408
Abstract :
Designing cloud computing setups is a challenging task. It involves understanding the impact of a plethora of parameters ranging from cluster configuration, partitioning, networking characteristics, and the targeted applications´ behavior. The design space, and the scale of the clusters, make it cumbersome and error-prone to test different cluster configurations using real setups. Thus, the community is increasingly relying on simulations and models of cloud setups to infer system behavior and the impact of design choices. The accuracy of the results from such approaches depends on the accuracy and realistic nature of the workload traces employed. Unfortunately, few cloud workload traces are available (in the public domain). In this paper, we present the key steps towards analyzing the traces that have been made public, e.g., from Google, and inferring lessons that can be used to design realistic cloud workloads as well as enable thorough quantitative studies of Hadoop design. Moreover, we leverage the lessons learned from the traces to undertake two case studies: (i) Evaluating Hadoop job schedulers, and (ii) Quantifying the impact of shared storage on Hadoop system performance.
Keywords :
cloud computing; software performance evaluation; Google; Hadoop design; Hadoop ecosystem; Hadoop job schedulers; Hadoop system performance; cloud computing; cloud workloads; cluster configuration; networking characteristics; realistic workload traces; shared storage; Analytical models; Biological system modeling; Color; Computational modeling; Google; Memory management; Visualization; Cloud computing; Design optimization; Performance analysis; Software performance modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2011 IEEE 19th International Symposium on
Conference_Location :
Singapore
ISSN :
1526-7539
Print_ISBN :
978-1-4577-0468-0
Type :
conf
DOI :
10.1109/MASCOTS.2011.59
Filename :
6005384
Link To Document :
بازگشت