DocumentCode :
720554
Title :
Joint Scheduling of Data and Computation in Geo-Distributed Cloud Systems
Author :
Lingyan Yin ; Jizhou Sun ; Laiping Zhao ; Chenzhou Cui ; Jian Xiao ; Ce Yu
Author_Institution :
Sch. of Comput. Sci. & Technol., Tianjin Univ., Tianjin, China
fYear :
2015
fDate :
4-7 May 2015
Firstpage :
657
Lastpage :
666
Abstract :
Recent trends show that cloud computing is growing to span more and more globally distributed data centers. For geo-distributed data centers, there is an increasing need for scheduling algorithms to place tasks across data centers, by jointly considering data and computation. This scheduling must deal with situations such as wide-area distributed data, data sharing, WAN bandwidth costs and data center capacity limits, while also minimizing completion time. However, this kind of scheduling problems is known to be NP-Hard. In this paper, inspired by real applications in astronomy field, we propose a two-phase scheduling algorithm that addresses these challenges. The mapping phase groups tasks considering the data-sharing relations, and dispatches groups to data centers by way of one-to-one correspondence. The reassigning phase balances the completion time across data centers according to relations between tasks and groups. We utilize the real China-Astronomy-Cloud model and typical applications to evaluate our proposal. Simulations show that our algorithm obtains up to 22% better completion time and effectively reduces the amount of data transfers compared with other similar scheduling algorithms.
Keywords :
astronomy computing; cloud computing; computational complexity; computer centres; scheduling; China-astronomy-cloud model; NP-hard scheduling problems; WAN bandwidth costs; astronomy field; cloud computing; data center capacity limits; data-sharing relations; geo-distributed cloud systems; geo-distributed data centers; joint data-computation scheduling algorithm; mapping phase group tasks; two-phase scheduling algorithm; wide-area distributed data; Astronomy; Computational modeling; Data models; Data transfer; Distributed databases; Scheduling algorithms; cloud computing; data and computation intensive; geo-distributed data centers; scheduling algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
Conference_Location :
Shenzhen
Type :
conf
DOI :
10.1109/CCGrid.2015.83
Filename :
7152531
Link To Document :
بازگشت