• DocumentCode
    170791
  • Title

    Joint scheduling of MapReduce jobs with servers: Performance bounds and experiments

  • Author

    Yi Yuan ; Dan Wang ; Jiangchuan Liu

  • Author_Institution
    Dept. of Comput., Hong Kong Polytech. Univ., Hong Kong, China
  • fYear
    2014
  • fDate
    April 27 2014-May 2 2014
  • Firstpage
    2175
  • Lastpage
    2183
  • Abstract
    MapReduce has achieved tremendous success for large-scale data processing in data centers. A key feature distinguishing MapReduce from previous parallel models is that it interleaves parallel and sequential computation. Past schemes, and especially their theoretical bounds, on general parallel models are therefore, unlikely to be applied to MapReduce directly. There are many recent studies on MapReduce job and task scheduling. These studies assume that the servers are assigned in advance. In current data centers, multiple MapReduce jobs of different importance levels run together. In this paper, we investigate a schedule problem for MapReduce taking server assignment into consideration as well. We formulate a MapReduce server-job organizer problem (MSJO) and show that it is NP-complete. We develop a 3-approximation algorithm and a fast heuristic. We evaluate our algorithms through both simulations and experiments on Amazon EC2 with an implementation in Hadoop. The results confirm the advantage of our algorithms.
  • Keywords
    approximation theory; computational complexity; parallel programming; scheduling; 3-approximation algorithm; Amazon EC2; Hadoop; MSJO; MapReduce server-job organizer problem; NP-complete problem; data centers; fast heuristic; job scheduling; joint scheduling; large-scale data processing; parallel computation; parallel models; sequential computation; server assignment; task scheduling; Complexity theory; Delays; Mars; Processor scheduling; Schedules; Scheduling; Servers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    INFOCOM, 2014 Proceedings IEEE
  • Conference_Location
    Toronto, ON
  • Type

    conf

  • DOI
    10.1109/INFOCOM.2014.6848160
  • Filename
    6848160