DocumentCode :
3732359
Title :
JellyFish: Online Performance Tuning with Adaptive Configuration and Elastic Container in Hadoop Yarn
Author :
Xiaoan Ding;Yi Liu;Depei Qian
Author_Institution :
Sino-German Joint Software Inst., Beihang Univ., Beijing, China
fYear :
2015
Firstpage :
831
Lastpage :
836
Abstract :
MapReduce is a popular computing framework for large-scale data processing. Practical experience shows that inappropriate configurations can result in poor performance of MapReduce jobs, however, it is challenging to pick out a suitable configuration in a short time. Also, current central resource scheduler may cause low resource utilization, and degrade the performance of the cluster. This paper proposes an online performance tuning system, JellyFish, to improve performance of MapReduce jobs and increase resource utilization in Hadoop YARN. JellyFish continually collects real-time statistics to optimize configuration and resource allocation dynamically during execution of a job. During performance tuning process, JellyFish firstly tunes configuration parameters by reducing the dimensionality of search space with a divide-and-conquer approach and using a model-based hill climbing algorithm to improve tuning efficiency; secondly, JellyFish re-schedules resources in nodes by using a novel elastic container that can expand and shrink dynamically according to resource usage, and a resource re-scheduling strategy to make full use of cluster resources. Experimental results show that JellyFish can improve performance of MapReduce jobs by an average of 24% for jobs run for the first time, and by an average of 65% for jobs run multiple times compared to default YARN.
Keywords :
"Containers","Tuning","Yarn","Resource management","Monitoring","Real-time systems","Heuristic algorithms"
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Systems (ICPADS), 2015 IEEE 21st International Conference on
Electronic_ISBN :
1521-9097
Type :
conf
DOI :
10.1109/ICPADS.2015.112
Filename :
7384375
Link To Document :
بازگشت