DocumentCode :
2862682
Title :
A Weighted K-Means Clustering Based Co-scheduling Strategy towards Efficient Execution of Scientific Workflows in Collaborative Cloud Environments
Author :
Deng, Kefeng ; Kong, Lingmei ; Song, Junqiang ; Ren, Kaijun ; Yuan, Dong
Author_Institution :
Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
fYear :
2011
fDate :
12-14 Dec. 2011
Firstpage :
547
Lastpage :
554
Abstract :
Due to the advantages of cost-effectiveness, on-demand resource provision and easy for sharing, cloud computing has grown in popularity with research community for deploying scientific applications such as workflows. When such interest continues growing and workflows are widely performed in collaborative cloud environments that consist of a number of data centers, there is an urgent need for exploiting strategies which can place the application data across globally distributed data centers and schedule tasks according to the data layout to reduce both the latency and make span for workflow execution. In this paper, by utilising dependencies among datasets and tasks, we propose an efficient data and task co scheduling strategy that can place input datasets in a load balance way and meanwhile group the mostly related datasets and tasks together. We build a simulation environment on Tianhe supercomputer to evaluate the proposed strategy and run simulations by random and realistic workflows. The results demonstrate that the proposed strategy can effectively improve workflows performance while reducing the total volume of data transfer across data centers.
Keywords :
cloud computing; computer centres; groupware; pattern clustering; resource allocation; scheduling; scientific information systems; workflow management software; Tianhe supercomputer; cloud computing; collaborative cloud environments; cost-effectiveness; data layout; data transfer; globally distributed data centers; load balance way; on-demand resource provision; random workflow; realistic workflow; schedule tasks; scientific workflows; simulation environment; task co scheduling strategy; weighted k-means clustering based co-scheduling strategy; workflow execution; workflows performance; Cloud computing; Clustering algorithms; Collaboration; Computational modeling; Data models; Distributed databases; Runtime; cloud computing; coscheduling; data placement; scientific workflow; task scheduling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable, Autonomic and Secure Computing (DASC), 2011 IEEE Ninth International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4673-0006-3
Type :
conf
DOI :
10.1109/DASC.2011.102
Filename :
6118736
Link To Document :
بازگشت