DocumentCode
256512
Title
Self adaptive Hadoop scheduler for heterogeneous resources
Author
Elkholy, A.M. ; Sallam, E.A.H.
Author_Institution
Comput. & Autom. Control Dept., Tanta Univ., Tanta, Egypt
fYear
2014
fDate
22-23 Dec. 2014
Firstpage
427
Lastpage
432
Abstract
Nowadays, Hadoop is a widely used framework for processing large data. Hadoop scheduler is a critical element which has a big effect on Hadoop performance. Finding a dynamic scheduler which adapts to different nodes computing capabilities and the same node performance is a challenging problem. Most of the current Hadoop schedulers consider the homogeneity of the resources on which Hadoop is running and assign each node in the cluster a fixed capacity over the run time, neglecting the different nodes computing capabilities and the performance of each node over the run time. This causes under/over utilization of resources, poor performance and longer run time. So, we propose a dynamic Hadoop scheduler which adapts to the performance and the computing capabilities of each node separately. The proposed scheduler controls the capacity of each node which represented by the number of tasks that can be processed concurrently at a time. The scheduler extends/shrinks the capacity of each node depending on its available resources and performance over the run time. Our scheduler is implemented on Hadoop and compared by the Hadoop Fair Scheduler. The experimental results show that our scheduler has achieved less average completion time and higher resources utilization.
Keywords
data handling; parallel processing; Hadoop fair scheduler; dynamic Hadoop scheduler; heterogeneous resources; self adaptive Hadoop scheduler; Heart beat; Hadoop; MapReduce; Scheduling;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Engineering & Systems (ICCES), 2014 9th International Conference on
Conference_Location
Cairo
Print_ISBN
978-1-4799-6593-9
Type
conf
DOI
10.1109/ICCES.2014.7030999
Filename
7030999
Link To Document