Title :
Analysis of resource usage profile for MapReduce applications using Hadoop on cloud
Author :
Liu, Zheyuan ; Mu, Dejun
Author_Institution :
Control & Network Inst., Northwestern Polytech. Univ., Xi´´an, China
Abstract :
In this paper we present a study of resource consumption profiles for MapReduce applications using Hadoop on Amazon EC2. We selected three applications and measured their resource usage in terms of CPU and memory footprint. Specifically, we study Grep, Word Count and Sort applications while altering Hadoop´s configuration parameters corresponding to I/O buffer. Our study brings up 3 key points. Firstly, effect of I/O parameters on total running time of the application; secondly, invalid assumptions of Hadoop scheduler that three phases (copy, sort and reduce) of a Reduce task are equal; finally, an insight supported by the results from the experiments on ways to improve the Hadoop scheduler for running multiple jobs by capturing the resource consumption information of different applications. To the best of our knowledge this is the first work that presents resource usage study.
Keywords :
cloud computing; resource allocation; Amazon EC2; CPU; Grep applications; Hadoop scheduler; I/O buffer; I/O parameters; MapReduce applications; Sort applications; Word Count applications; cloud computing; configuration parameters; memory footprint; reduce task; resource consumption information; resource consumption profiles; resource usage profile analysis; Computational modeling; Databases; File systems; Resource management; Schedules; Web services; Writing; hadoop scheduler; map reduce; resource usage;
Conference_Titel :
Quality, Reliability, Risk, Maintenance, and Safety Engineering (ICQR2MSE), 2012 International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4673-0786-4
DOI :
10.1109/ICQR2MSE.2012.6246510