Title :
A Load-Driven Task Scheduler with Adaptive DSC for MapReduce
Author :
Mao, Hong ; Hu, Shengqiu ; Zhang, Zhenzhong ; Xiao, Limin ; Ruan, Li
Author_Institution :
State Key Lab. of Software Dev. Environ., Beihang Univ., Beijing, China
Abstract :
With the rapid development of internet applications, more and more network service and commercial applications are deployed to cloud computing environment, with petabytes of data to be processed. MapReduce is one of the most famous solutions for large-scale data processing. This paper focuses on optimizing the scheduler of MapReduce framework in task level. We care about the hardware configuration and real-time workload of the nodes in a hadoop cluster and aim at shortening time cost of MapReduce jobs and improving hardware resource utilization rate. We put forward a load-driven task scheduler which assigns tasks to Task Trackers according to the workload of slave nodes. It is based on a Dynamic Slot Controller (DSC) that can adjust Map task Slot (MS) and Reduce task Slot (RS) of Task Trackers running on slave nodes adaptively. Our load-driven task scheduler can shorten time consumption of MapReduce job by 14% and improve the CPU utilization rate of hadoop cluster by 34% when processing 10GB data.
Keywords :
Internet; cloud computing; distributed processing; scheduling; CPU utilization rate; MapReduce; adaptive DSC; cloud computing environment; dynamic slot controller; hadoop cluster; hardware resource utilization rate; large-scale data processing; load-driven task scheduler; network service; reduce task slot; slave nodes; task trackers; Blades; Hardware; Job shop scheduling; Monitoring; Resource management; Servers; Hadoop; MapReduce; Task Schedule; Task slot; Workload;
Conference_Titel :
Green Computing and Communications (GreenCom), 2011 IEEE/ACM International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4577-1006-3
Electronic_ISBN :
978-0-7695-4466-3
DOI :
10.1109/GreenCom.2011.13