• DocumentCode
    720554
  • Title

    Joint Scheduling of Data and Computation in Geo-Distributed Cloud Systems

  • Author

    Lingyan Yin ; Jizhou Sun ; Laiping Zhao ; Chenzhou Cui ; Jian Xiao ; Ce Yu

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Tianjin Univ., Tianjin, China
  • fYear
    2015
  • fDate
    4-7 May 2015
  • Firstpage
    657
  • Lastpage
    666
  • Abstract
    Recent trends show that cloud computing is growing to span more and more globally distributed data centers. For geo-distributed data centers, there is an increasing need for scheduling algorithms to place tasks across data centers, by jointly considering data and computation. This scheduling must deal with situations such as wide-area distributed data, data sharing, WAN bandwidth costs and data center capacity limits, while also minimizing completion time. However, this kind of scheduling problems is known to be NP-Hard. In this paper, inspired by real applications in astronomy field, we propose a two-phase scheduling algorithm that addresses these challenges. The mapping phase groups tasks considering the data-sharing relations, and dispatches groups to data centers by way of one-to-one correspondence. The reassigning phase balances the completion time across data centers according to relations between tasks and groups. We utilize the real China-Astronomy-Cloud model and typical applications to evaluate our proposal. Simulations show that our algorithm obtains up to 22% better completion time and effectively reduces the amount of data transfers compared with other similar scheduling algorithms.
  • Keywords
    astronomy computing; cloud computing; computational complexity; computer centres; scheduling; China-astronomy-cloud model; NP-hard scheduling problems; WAN bandwidth costs; astronomy field; cloud computing; data center capacity limits; data-sharing relations; geo-distributed cloud systems; geo-distributed data centers; joint data-computation scheduling algorithm; mapping phase group tasks; two-phase scheduling algorithm; wide-area distributed data; Astronomy; Computational modeling; Data models; Data transfer; Distributed databases; Scheduling algorithms; cloud computing; data and computation intensive; geo-distributed data centers; scheduling algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
  • Conference_Location
    Shenzhen
  • Type

    conf

  • DOI
    10.1109/CCGrid.2015.83
  • Filename
    7152531