Title :
Minimizing makespan and total completion time in MapReduce-like systems
Author :
Yuqing Zhu ; Yiwei Jiang ; Weili Wu ; Ling Ding ; Teredesai, Ankur ; Deying Li ; Wonjun Lee
Author_Institution :
Dept. of Comput. Sci., Univ. of Texas at Dallas, Dallas, TX, USA
fDate :
April 27 2014-May 2 2014
Abstract :
Effectiveness of MapReduce as a big data processing framework depends on efficiencies of scale for both map and reduce phases. While most map tasks are preemptive and parallelizable, the reduce tasks typically are not easily decomposed and often become a bottleneck due to constraints of data locality and task complexity. By assuming that reduce tasks are non-parallelizable, we study offline scheduling of minimizing makespan and minimizing total completion time, respectively. Both preemptive and non-preemptive reduce tasks are considered. On makespan minimization, for preemptive version we design an algorithm and prove its optimality, for non-preemptive version we design an approximation algorithm with the worst ratio of 3/2-1/2h where h is the number of machines. On total complete time minimization, for non-preemptive version we devise an approximation algorithm with worst case ratio of 2-1/h, and for preemptive version we devise a heuristic. We confirm that our algorithms outperform state-of-art schedulers through experiments.
Keywords :
Big Data; approximation theory; data mining; MapReduce-like system; approximation algorithm; big data processing; makespan minimization; offline scheduling; total completion time; Algorithm design and analysis; Approximation algorithms; Approximation methods; Educational institutions; Minimization; Optimal scheduling; Schedules;
Conference_Titel :
INFOCOM, 2014 Proceedings IEEE
Conference_Location :
Toronto, ON
DOI :
10.1109/INFOCOM.2014.6848159