DocumentCode :
576927
Title :
A Practical Performance Model for Hadoop MapReduce
Author :
Lin, Xuelian ; Meng, Zide ; Xu, Chuan ; Wang, Meng
Author_Institution :
Sch. of Comput. Sci. & Eng., Beihang Univ., Beijing, China
fYear :
2012
fDate :
24-28 Sept. 2012
Firstpage :
231
Lastpage :
239
Abstract :
An accurate performance model for MapReduce is increasingly important for analyzing and optimizing MapReduce jobs. It is also a precondition to implement cost-based scheduling strategies or to translate Hive like query jobs into sets of low cost MapReduce jobs. However, the multiple processing steps in MapReduce task, as well as the complexity of relationships among these steps and the difficulty to measure the computational complexity of MapReduce task, greatly challenges the development and application of a precise performance model. In this paper, we define the concept of relative computational complexity of MapReduce task to estimate the complexity of task, and illustrate the way to measure it. Then, we analyze the detail composition of MapReduce tasks and relationships among them, decompose the major cost items, and present a vector style cost model with equation to calculate each cost items. Moreover, we provide equations to estimate the task execution time based on cost vectors. The experiment on several Hadoop clusters confirms the effectiveness of our proposed performance model.
Keywords :
computational complexity; distributed processing; query processing; scheduling; Hadoop MapReduce; computational complexity; cost-based scheduling strategies; hive like query jobs; practical performance model; vector style cost model; Computational complexity; Computational modeling; Hard disks; Mathematical model; Sorting; Vectors; Writing; Hadoop; MapReduce; performance model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing Workshops (CLUSTER WORKSHOPS), 2012 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2893-7
Type :
conf
DOI :
10.1109/ClusterW.2012.24
Filename :
6355869
Link To Document :
بازگشت