DocumentCode :
1791546
Title :
Scheduling MapReduce tasks on virtual MapReduce clusters from a tenant´s perspective
Author :
Jia-Chun Lin ; Ming-Chang Lee ; Yahyapour, Ramin
Author_Institution :
Dept. of Comput. Sci., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fYear :
2014
fDate :
27-30 Oct. 2014
Firstpage :
141
Lastpage :
146
Abstract :
Renting a set of virtual private servers (VPSs for short) from a VPS provider to establish a virtual MapReduce cluster is cost-efficient for a company/organization. To shorten job turnaround time and keep data locality as high as possible in this type of environment, this paper proposes a Best-Fit Task Scheduling scheme (BFTS for short) from a tenant´s perspective. BFTS schedules each map task to a VPS that can finish the task earlier than the other VPSs by predicting and comparing the time required by every VPS to retrieve the map-input data, execute the map task, and become idle in an online manner. Furthermore, BFTS schedules each reduce task to a VPS that is close to most VPSs that execute the related map tasks. We conduct extensive experiments to compare BFTS with several scheduling algorithms employed by Hadoop. The experimental results show that BFTS is better than the other tested algorithms in terms of map-data locality, reduce-data locality, and job turnaround time. The overhead incurred by BFTS is also evaluated, which is inevitable but acceptable compared with the other algorithms.
Keywords :
parallel programming; pattern clustering; scheduling; BFTS scheme; Hadoop; MapReduce task scheduling; VPS; best-fit task scheduling scheme; data locality; job turnaround time; scheduling algorithm; tenant perspective; virtual MapReduce clusters; virtual private servers; Benchmark testing; Clustering algorithms; Optical wavelength conversion; Schedules; Scheduling; Scheduling algorithms; Servers; MapReduce; data locality; map-task scheduling; reduce-task scheduling; virtual MapReduce cluster;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
Type :
conf
DOI :
10.1109/BigData.2014.7004223
Filename :
7004223
Link To Document :
بازگشت