• DocumentCode
    2013462
  • Title

    Joint scheduling of processing and Shuffle phases in MapReduce systems

  • Author

    Chen, Fangfei ; Kodialam, Murali ; Lakshman, T.V.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Penn State Univ., University Park, PA, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    1143
  • Lastpage
    1151
  • Abstract
    MapReduce has emerged as an important paradigm for processing data in large data centers. MapReduce is a three phase algorithm comprising of Map, Shuffle and Reduce phases. Due to its widespread deployment, there have been several recent papers outlining practical schemes to improve the performance of MapReduce systems. All these efforts focus on one of the three phases to obtain performance improvement. In this paper, we consider the problem of jointly scheduling all three phases of the MapReduce process with a view of understanding the theoretical complexity of the joint scheduling and working towards practical heuristics for scheduling the tasks. We give guaranteed approximation algorithms and outline several heuristics to solve the joint scheduling problem.
  • Keywords
    approximation theory; computational complexity; computer centres; data handling; scheduling; software performance evaluation; MapReduce systems; Shuffle phases joint scheduling; approximation algorithms; data centers; data processing; map phases; performance improvement; processing joint scheduling; reduce phases; tasks scheduling; Approximation algorithms; Approximation methods; Copper; Linear programming; Processor scheduling; Program processors; TV;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    INFOCOM, 2012 Proceedings IEEE
  • Conference_Location
    Orlando, FL
  • ISSN
    0743-166X
  • Print_ISBN
    978-1-4673-0773-4
  • Type

    conf

  • DOI
    10.1109/INFCOM.2012.6195473
  • Filename
    6195473