DocumentCode
2703400
Title
Analysis of resource usage profile for MapReduce applications using Hadoop on cloud
Author
Liu, Zheyuan ; Mu, Dejun
Author_Institution
Control & Network Inst., Northwestern Polytech. Univ., Xi´´an, China
fYear
2012
fDate
15-18 June 2012
Firstpage
1500
Lastpage
1504
Abstract
In this paper we present a study of resource consumption profiles for MapReduce applications using Hadoop on Amazon EC2. We selected three applications and measured their resource usage in terms of CPU and memory footprint. Specifically, we study Grep, Word Count and Sort applications while altering Hadoop´s configuration parameters corresponding to I/O buffer. Our study brings up 3 key points. Firstly, effect of I/O parameters on total running time of the application; secondly, invalid assumptions of Hadoop scheduler that three phases (copy, sort and reduce) of a Reduce task are equal; finally, an insight supported by the results from the experiments on ways to improve the Hadoop scheduler for running multiple jobs by capturing the resource consumption information of different applications. To the best of our knowledge this is the first work that presents resource usage study.
Keywords
cloud computing; resource allocation; Amazon EC2; CPU; Grep applications; Hadoop scheduler; I/O buffer; I/O parameters; MapReduce applications; Sort applications; Word Count applications; cloud computing; configuration parameters; memory footprint; reduce task; resource consumption information; resource consumption profiles; resource usage profile analysis; Computational modeling; Databases; File systems; Resource management; Schedules; Web services; Writing; hadoop scheduler; map reduce; resource usage;
fLanguage
English
Publisher
ieee
Conference_Titel
Quality, Reliability, Risk, Maintenance, and Safety Engineering (ICQR2MSE), 2012 International Conference on
Conference_Location
Chengdu
Print_ISBN
978-1-4673-0786-4
Type
conf
DOI
10.1109/ICQR2MSE.2012.6246510
Filename
6246510
Link To Document