DocumentCode
720554
Title
Joint Scheduling of Data and Computation in Geo-Distributed Cloud Systems
Author
Lingyan Yin ; Jizhou Sun ; Laiping Zhao ; Chenzhou Cui ; Jian Xiao ; Ce Yu
Author_Institution
Sch. of Comput. Sci. & Technol., Tianjin Univ., Tianjin, China
fYear
2015
fDate
4-7 May 2015
Firstpage
657
Lastpage
666
Abstract
Recent trends show that cloud computing is growing to span more and more globally distributed data centers. For geo-distributed data centers, there is an increasing need for scheduling algorithms to place tasks across data centers, by jointly considering data and computation. This scheduling must deal with situations such as wide-area distributed data, data sharing, WAN bandwidth costs and data center capacity limits, while also minimizing completion time. However, this kind of scheduling problems is known to be NP-Hard. In this paper, inspired by real applications in astronomy field, we propose a two-phase scheduling algorithm that addresses these challenges. The mapping phase groups tasks considering the data-sharing relations, and dispatches groups to data centers by way of one-to-one correspondence. The reassigning phase balances the completion time across data centers according to relations between tasks and groups. We utilize the real China-Astronomy-Cloud model and typical applications to evaluate our proposal. Simulations show that our algorithm obtains up to 22% better completion time and effectively reduces the amount of data transfers compared with other similar scheduling algorithms.
Keywords
astronomy computing; cloud computing; computational complexity; computer centres; scheduling; China-astronomy-cloud model; NP-hard scheduling problems; WAN bandwidth costs; astronomy field; cloud computing; data center capacity limits; data-sharing relations; geo-distributed cloud systems; geo-distributed data centers; joint data-computation scheduling algorithm; mapping phase group tasks; two-phase scheduling algorithm; wide-area distributed data; Astronomy; Computational modeling; Data models; Data transfer; Distributed databases; Scheduling algorithms; cloud computing; data and computation intensive; geo-distributed data centers; scheduling algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
Conference_Location
Shenzhen
Type
conf
DOI
10.1109/CCGrid.2015.83
Filename
7152531
Link To Document