• DocumentCode
    1806487
  • Title

    Data rearrange based on mining block access sequence in Cloud Storage

  • Author

    Du, Hongtao ; Li, Zhanhuai

  • Author_Institution
    Comput. Coll., Northwestern Polytech. Univ., Xi´´An, China
  • Volume
    4
  • fYear
    2011
  • fDate
    24-26 Dec. 2011
  • Firstpage
    2507
  • Lastpage
    2511
  • Abstract
    In Cloud Storage system, storage systems have to service for large numbers of scattered data access nodes, and data I/O almost are in random access form, in addition, the distribution storage of data result in data transmission between the Cloud nodes increases greatly, then the performance of the cloud storage system was be restricted remarkably. In this paper, we put forward a system for improving access performance in Cloud Storage system. Through the access trace, the process which doing disk I/O should be detected. The processes that are executed contemporaneous should be regarded as a group. Then, the block access sequences belong to same process groups could be mined based on the Frequent Sequence Mining. And the block relation could be obtained. Ultimately, the related blocks could be rearranged to the near location. It could reduce the head moving of disk during access and realize the mapping from random access to sequence access partly. Furthermore, the data could be migrated between the storage nodes according to the block relation and reducing the data transfer through network between nodes. We have also evaluated the benefits of the system. Our result using real system workloads show that with the data rearrange and data transfer, about 10%-20% I/O response time could be reduced.
  • Keywords
    cloud computing; data communication; data mining; disc storage; information retrieval; software performance evaluation; storage area networks; storage management; I/O response time; access performance improvement; block access sequence mining; cloud nodes; cloud storage system; data rearange technique; data transfer; data transmission; disk I/O; disk head; distribution storage; frequent sequence mining; random access data I/O; scattered data access nodes; Educational institutions; Reliability; Servers; Cloud Storage; Data Rearrange; Mining Block Access Sequence;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Network Technology (ICCSNT), 2011 International Conference on
  • Conference_Location
    Harbin
  • Print_ISBN
    978-1-4577-1586-0
  • Type

    conf

  • DOI
    10.1109/ICCSNT.2011.6182479
  • Filename
    6182479