Title :
Speculative Execution for a Single Job in a MapReduce-Like System
Author :
Huanle Xu ; Wing Cheong Lau
Author_Institution :
Dept. of Inf. Eng., Chinese Univ. of Hong Kong, Hong Kong, China
fDate :
June 27 2014-July 2 2014
Abstract :
Parallel processing plays an important role for large-scale data analytics. It breaks a job into many small tasks which run parallel on multiple machines such as MapReduce framework. One fundamental challenge faced to such parallel processing is the straggling tasks as they can delay the completion of a job seriously. In this paper, we focus on the speculative execution issue which is used to deal with the straggling problem in the literature. We present a theoretical framework for the optimization of a single job which differs a lot from the previous heuristics-based work. More precisely, we propose two schemes when the number of parallel tasks the job consists of is smaller than cluster size. In the first scheme, no monitoring is needed and we can provide the job deadline guarantee with a high probability while achieve the optimal resource consumption level. The second scheme needs to monitor the task progress and makes the optimal number of duplicates when the straggling problem happens. On the other hand, when the number of tasks in a job is larger than the cluster size, we propose an Enhanced Speculative Execution (ESE) algorithm to make the optimal decision whenever a machine is available for a new scheduling. The simulation results show the ESE algorithm can reduce the job flow time by 50% while consume fewer resources comparing to the strategy without backup.
Keywords :
data analysis; optimisation; parallel programming; probability; scheduling; ESE algorithm; MapReduce-like system; enhanced speculative execution algorithm; job flow time reduction; large-scale data analytics; parallel processing; probability; scheduling; single job optimization; Algorithm design and analysis; Clustering algorithms; Equations; Loading; Monitoring; Optimization; Parallel processing; MapReduce; resource optimization; speculative execution; theoretical analysis;
Conference_Titel :
Cloud Computing (CLOUD), 2014 IEEE 7th International Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4799-5062-1
DOI :
10.1109/CLOUD.2014.84