• DocumentCode
    3651943
  • Title

    Insight and reduction of MapReduce stragglers in heterogeneous environment

  • Author

    Xia Zhao; Kai Kang; YuZhong Sun; Yin Song; Minhao Xu; Tao Pan

  • Author_Institution
    Sch. of Comput. &
  • fYear
    2013
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Speculative and clone execution are existing techniques to overcome the problems of task stragglers and performance degradation in heterogeneous clusters for big data processing. In this paper, we propose an alternative approach to solving the problems based on analysis results of profiling and the relations of the system parameters. Our approach adjusts the amount of task slots of nodes dynamically to match the processing power of the nodes, according to current task progress rate and resource utilization. It contrasts with the existing techniques by attempting to prevent task stragglers from occurring in the first place through maintaining a balance between resource supply and demand. We have implemented this method in the Hadoop MapReduce platform, and the TPC-H benchmark results show that it achieves 20-30% performance improvement and 35-88% less stragglers than existing techniques.
  • Keywords
    "Fluctuations","Scheduling algorithms"
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing (CLUSTER), 2013 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/CLUSTER.2013.6702673
  • Filename
    6702673