DocumentCode
3651943
Title
Insight and reduction of MapReduce stragglers in heterogeneous environment
Author
Xia Zhao; Kai Kang; YuZhong Sun; Yin Song; Minhao Xu; Tao Pan
Author_Institution
Sch. of Comput. &
fYear
2013
Firstpage
1
Lastpage
8
Abstract
Speculative and clone execution are existing techniques to overcome the problems of task stragglers and performance degradation in heterogeneous clusters for big data processing. In this paper, we propose an alternative approach to solving the problems based on analysis results of profiling and the relations of the system parameters. Our approach adjusts the amount of task slots of nodes dynamically to match the processing power of the nodes, according to current task progress rate and resource utilization. It contrasts with the existing techniques by attempting to prevent task stragglers from occurring in the first place through maintaining a balance between resource supply and demand. We have implemented this method in the Hadoop MapReduce platform, and the TPC-H benchmark results show that it achieves 20-30% performance improvement and 35-88% less stragglers than existing techniques.
Keywords
"Fluctuations","Scheduling algorithms"
Publisher
ieee
Conference_Titel
Cluster Computing (CLUSTER), 2013 IEEE International Conference on
Type
conf
DOI
10.1109/CLUSTER.2013.6702673
Filename
6702673
Link To Document