DocumentCode :
3717177
Title :
TPS: A task placement strategy for big data workflows
Author :
Mahdi Ebrahimi;Aravind Mohan;Shiyong Lu;Robert Reynolds
Author_Institution :
Wayne State University Detroit, U.S.A.
fYear :
2015
Firstpage :
523
Lastpage :
530
Abstract :
Workflow makespan is the total execution time for running a workflow in the Cloud. The workflow makespan significantly depends on how the workflow tasks and datasets are allocated and placed in a distributed computing environment such as Clouds. Incorporating data and task allocation strategies to minimize makespan delivers significant benefits to scientific users in receiving their results in time. The main goal of a task placement algorithm is to minimize the total amount of data movement between virtual machines during the execution of the workflows. In this paper, we do the following: 1) formalize the task placement problem in big data workflows; 2) propose a task placement strategy (TPS) that considers both initial input datasets and intermediate datasets to calculate the dependency between workflow tasks; and 3) perform extensive experiments in the distributed environment to demonstrate that the proposed strategy provides an effective task distribution and placement tool.
Keywords :
"Virtual machining","Cloud computing","Big data","Computational modeling","Data transfer","Evolutionary computation","Genetic algorithms"
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/BigData.2015.7363795
Filename :
7363795
Link To Document :
بازگشت