DocumentCode
2427869
Title
Cyclic Workflow Execution Mechanism on Top of MapReduce Framework
Author
Wu, Rong ; Shuai, Liang ; Liao, Huaming
Author_Institution
Inst. of Comput. Technol., Beijing, China
fYear
2011
fDate
24-26 Oct. 2011
Firstpage
28
Lastpage
35
Abstract
MapReduce programming model has been used in various kinds of intensive data processing and analysis projects for its ease of use and good scalability. In this paper, we discuss about the execution mechanism of cyclic workflow on top of MapReduce framework. A novel cycle elimination algorithm is proposed to decompose the cyclic workflow to DAG (Directed Acyclic Graph) sub-workflows. It dynamically and recursively searches for the maximum DAG sub-workflow according to current decision result of the decision node in each iteration. DAG sub-workflow scheduling strategy, which is comprised of DAG grouping mechanism and MapReduce task mapping, is also presented. Finally, we propose an intermediate data transmission mechanism named Partition Pushing, which can improve the possible parallelism between the executions of dependent jobs. Experiments show that our proposed workflow execution mechanism can schedule the cyclic workflow efficiently by improving the parallelism between dependent jobs and consequently reduce the workflow make span by 20%-60%.
Keywords
data analysis; directed graphs; distributed processing; scheduling; DAG grouping mechanism; DAG sub-workflow scheduling strategy; MapReduce framework; MapReduce programming model; MapReduce task mapping; cycle elimination algorithm; cyclic workflow execution mechanism; data analysis; data processing; data transmission mechanism; directed acyclic graph; partition pushing; Data communication; Data processing; Heuristic algorithms; Instruction sets; Processor scheduling; Schedules; Cyclic Workflow; Hadoop; Iterative Computing; MapReduce; Workflow Schedule;
fLanguage
English
Publisher
ieee
Conference_Titel
Semantics Knowledge and Grid (SKG), 2011 Seventh International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4577-1323-1
Type
conf
DOI
10.1109/SKG.2011.46
Filename
6088088
Link To Document