• DocumentCode
    262301
  • Title

    Shared I/O Scheduling in Cloud for Structured Data Processing

  • Author

    Baoquan Zhang ; Jingmei Li ; Tao Xu ; Dongsheng Wang ; Nan Ding

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Harbin Eng. Univ., Harbin, China
  • fYear
    2014
  • fDate
    3-5 Dec. 2014
  • Firstpage
    159
  • Lastpage
    166
  • Abstract
    Cloud plays an important role in the structure data processing because of its high I/O throughput and excellent capability of computing. At present, since the importance of structure data processing has been greater than before, cloud has confront with a higher pressure of data processing requirement. Massive tasks of data query and analysis, which send innumerable data I/O requests, are running on cloud, bringing unprecedented problems on I/O scheduling. In this paper, a kind of shared I/O scheduling method is proposed. Firstly, during the scheduling, the difference of performance among nodes has been considered. A mathematic model of finish time predication is established to estimate the time cost of each node to finish the tasks. And the requests will be finally assigned to the nodes which have lowest time cost. Meanwhile, to further save the I/O resources, a shared I/O mechanism which merges the requests to same table to a shared request has been raised. By using the shared I/O mechanism, the number of the requests can be prominently decreased and the performance of concurrent data queries can also be improved since repetitive read has been avoided. In the end of the paper, we evaluate the performance of the method by several experiments. The results indicate that the shared I/O scheduling method can effectively save the I/O resources and improve the performance of data processing, having wide range of potential applications.
  • Keywords
    cloud computing; data analysis; scheduling; concurrent data queries; data analysis; data query; finish time predication; mathematic model; shared I/O scheduling method; structure data processing; Data processing; Databases; Hard disks; Mathematical model; Merging; Throughput; I/O schedule; cloud computing; shared request; structured data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data and Cloud Computing (BdCloud), 2014 IEEE Fourth International Conference on
  • Conference_Location
    Sydney, NSW
  • Type

    conf

  • DOI
    10.1109/BDCloud.2014.53
  • Filename
    7034781