Title :
Shared I/O Scheduling in Cloud for Structured Data Processing
Author :
Baoquan Zhang ; Jingmei Li ; Tao Xu ; Dongsheng Wang ; Nan Ding
Author_Institution :
Coll. of Comput. Sci. & Technol., Harbin Eng. Univ., Harbin, China
Abstract :
Cloud plays an important role in the structure data processing because of its high I/O throughput and excellent capability of computing. At present, since the importance of structure data processing has been greater than before, cloud has confront with a higher pressure of data processing requirement. Massive tasks of data query and analysis, which send innumerable data I/O requests, are running on cloud, bringing unprecedented problems on I/O scheduling. In this paper, a kind of shared I/O scheduling method is proposed. Firstly, during the scheduling, the difference of performance among nodes has been considered. A mathematic model of finish time predication is established to estimate the time cost of each node to finish the tasks. And the requests will be finally assigned to the nodes which have lowest time cost. Meanwhile, to further save the I/O resources, a shared I/O mechanism which merges the requests to same table to a shared request has been raised. By using the shared I/O mechanism, the number of the requests can be prominently decreased and the performance of concurrent data queries can also be improved since repetitive read has been avoided. In the end of the paper, we evaluate the performance of the method by several experiments. The results indicate that the shared I/O scheduling method can effectively save the I/O resources and improve the performance of data processing, having wide range of potential applications.
Keywords :
cloud computing; data analysis; scheduling; concurrent data queries; data analysis; data query; finish time predication; mathematic model; shared I/O scheduling method; structure data processing; Data processing; Databases; Hard disks; Mathematical model; Merging; Throughput; I/O schedule; cloud computing; shared request; structured data;
Conference_Titel :
Big Data and Cloud Computing (BdCloud), 2014 IEEE Fourth International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/BDCloud.2014.53