DocumentCode
2013462
Title
Joint scheduling of processing and Shuffle phases in MapReduce systems
Author
Chen, Fangfei ; Kodialam, Murali ; Lakshman, T.V.
Author_Institution
Dept. of Comput. Sci. & Eng., Penn State Univ., University Park, PA, USA
fYear
2012
fDate
25-30 March 2012
Firstpage
1143
Lastpage
1151
Abstract
MapReduce has emerged as an important paradigm for processing data in large data centers. MapReduce is a three phase algorithm comprising of Map, Shuffle and Reduce phases. Due to its widespread deployment, there have been several recent papers outlining practical schemes to improve the performance of MapReduce systems. All these efforts focus on one of the three phases to obtain performance improvement. In this paper, we consider the problem of jointly scheduling all three phases of the MapReduce process with a view of understanding the theoretical complexity of the joint scheduling and working towards practical heuristics for scheduling the tasks. We give guaranteed approximation algorithms and outline several heuristics to solve the joint scheduling problem.
Keywords
approximation theory; computational complexity; computer centres; data handling; scheduling; software performance evaluation; MapReduce systems; Shuffle phases joint scheduling; approximation algorithms; data centers; data processing; map phases; performance improvement; processing joint scheduling; reduce phases; tasks scheduling; Approximation algorithms; Approximation methods; Copper; Linear programming; Processor scheduling; Program processors; TV;
fLanguage
English
Publisher
ieee
Conference_Titel
INFOCOM, 2012 Proceedings IEEE
Conference_Location
Orlando, FL
ISSN
0743-166X
Print_ISBN
978-1-4673-0773-4
Type
conf
DOI
10.1109/INFCOM.2012.6195473
Filename
6195473
Link To Document