DocumentCode
170789
Title
Minimizing makespan and total completion time in MapReduce-like systems
Author
Yuqing Zhu ; Yiwei Jiang ; Weili Wu ; Ling Ding ; Teredesai, Ankur ; Deying Li ; Wonjun Lee
Author_Institution
Dept. of Comput. Sci., Univ. of Texas at Dallas, Dallas, TX, USA
fYear
2014
fDate
April 27 2014-May 2 2014
Firstpage
2166
Lastpage
2174
Abstract
Effectiveness of MapReduce as a big data processing framework depends on efficiencies of scale for both map and reduce phases. While most map tasks are preemptive and parallelizable, the reduce tasks typically are not easily decomposed and often become a bottleneck due to constraints of data locality and task complexity. By assuming that reduce tasks are non-parallelizable, we study offline scheduling of minimizing makespan and minimizing total completion time, respectively. Both preemptive and non-preemptive reduce tasks are considered. On makespan minimization, for preemptive version we design an algorithm and prove its optimality, for non-preemptive version we design an approximation algorithm with the worst ratio of 3/2-1/2h where h is the number of machines. On total complete time minimization, for non-preemptive version we devise an approximation algorithm with worst case ratio of 2-1/h, and for preemptive version we devise a heuristic. We confirm that our algorithms outperform state-of-art schedulers through experiments.
Keywords
Big Data; approximation theory; data mining; MapReduce-like system; approximation algorithm; big data processing; makespan minimization; offline scheduling; total completion time; Algorithm design and analysis; Approximation algorithms; Approximation methods; Educational institutions; Minimization; Optimal scheduling; Schedules;
fLanguage
English
Publisher
ieee
Conference_Titel
INFOCOM, 2014 Proceedings IEEE
Conference_Location
Toronto, ON
Type
conf
DOI
10.1109/INFOCOM.2014.6848159
Filename
6848159
Link To Document