DocumentCode :
1683629
Title :
A Dynamic MapReduce Scheduler for Heterogeneous Workloads
Author :
Tian, Chao ; Zhou, Haojie ; He, Yongqiang ; Zha, Li
Author_Institution :
Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
fYear :
2009
Firstpage :
218
Lastpage :
224
Abstract :
MapReduce is an important programming model for building data centers containing ten of thousands of nodes. In a practical data center of that scale, it is a common case that I/O-bound jobs and CPU-bound jobs, which demand different resources, run simultaneously in the same cluster. In the MapReduce framework, parallelization of these two kinds of job has not been concerned. In this paper, we give a new view of the MapReduce model, and classify the MapReduce workloads into three categories based on their CPU and I/O utilization. With workload classification, we design a new dynamic MapReduce workload predict mechanism, MR-Predict, which detects the workload type on the fly. We propose a Triple-Queue Scheduler based on the MR-Predict mechanism. The Triple-Queue scheduler could improve the usage of both CPU and disk I/O resources under heterogeneous workloads. And it could improve the Hadoop throughput by about 30% under heterogeneous workloads.
Keywords :
data handling; resource allocation; scheduling; CPU utilization; CPU-bound job; Hadoop throughput; I/O utilization; I/O-bound job; MR-Predict mechanism; Triple-Queue Scheduler; data center; dynamic MapReduce scheduler; heterogeneous workload; resource usage; workload classification; Chaos; Computers; Dynamic programming; Dynamic scheduling; Grid computing; Hardware; Helium; Processor scheduling; Throughput; Web and internet services; MapReduce; Schdule; heterogeneous workloads;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Grid and Cooperative Computing, 2009. GCC '09. Eighth International Conference on
Conference_Location :
Lanzhou, Gansu
Print_ISBN :
978-0-7695-3766-5
Type :
conf
DOI :
10.1109/GCC.2009.19
Filename :
5279616
Link To Document :
بازگشت