DocumentCode
2995139
Title
Job Scheduling Optimization for Multi-user MapReduce Clusters
Author
Tao, Yongcai ; Zhang, Qing ; Shi, Lei ; Chen, Pinhua
Author_Institution
Sch. of Inf. Eng., Zhengzhou Univ., Zhengzhou, China
fYear
2011
fDate
9-11 Dec. 2011
Firstpage
213
Lastpage
217
Abstract
A shared MapReduce cluster is beneficial to build data warehouse which can be used by multiple users. FAIR scheduler gives each user the illusion of owning a private cluster. Moreover, it can dynamic redistribute capacity unused by some users to other users. However, when reassigning the slots, FAIR picks the most recently launched tasks to kill without considering the job character and data locality, which increases the network traffic while rescheduling the killed Map/Reduce tasks. The paper, based on FAIR scheduling, proposes an improved FAIR scheduling algorithm, which take into account the job character and data locality while killing tasks to make slots for new users. Performance evaluation results demonstrate that the improved FAIR decreases the data movement, speeds the execution of jobs, consequently improving the system performance.
Keywords
data warehouses; optimisation; pattern clustering; performance evaluation; scheduling; FAIR scheduler; FAIR scheduling algorithm; data locality; data warehouse; job character; job scheduling optimization; killed MapReduce tasks; multiuser MapReduce clusters; network traffic; performance evaluation; private cluster; shared MapReduce cluster; Benchmark testing; Educational institutions; File systems; Scheduling; Scheduling algorithm; Tin; HDFS; Hadoop; MapReduce; job Scheduling;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Architectures, Algorithms and Programming (PAAP), 2011 Fourth International Symposium on
Conference_Location
Tianjin
Print_ISBN
978-1-4577-1808-3
Type
conf
DOI
10.1109/PAAP.2011.33
Filename
6128505
Link To Document