• DocumentCode
    15973
  • Title

    Scale-RS: An Efficient Scaling Scheme for RS-Coded Storage Clusters

  • Author

    Jianzhong Huang ; Xianhai Liang ; Xiao Qin ; Ping Xie ; Changsheng Xie

  • Author_Institution
    Wuhan Nat. Lab. for Optoelectron., Huazhong Univ. of Sci. & Technol., Wuhan, China
  • Volume
    26
  • Issue
    6
  • fYear
    2015
  • fDate
    June 1 2015
  • Firstpage
    1704
  • Lastpage
    1717
  • Abstract
    It is indispensable to scale erasure-coded storage clusters to meet requirements of increased storage capacity and I/O performance. In this study, we propose an efficient scaling scheme for Reed-Solomon-coded storage clusters called Scale-RS, which has three salient features. First, Scale-RS achieves uniform data distribution by equally placing data blocks among old and new chunks using a transposed data layout. Second, Scale-RS minimizes data movement incurred in the procedures of data redistribution and parity update. Scale-RS not only reaches the lower bound of data migration traffic by transferring necessary data blocks from old data chunks to new chunks, but it also reduces update traffic via generating parity difference blocks from data blocks stored in an individual data chunk. Third, Scale-RS improves the I/O performance of scaled storage clusters in terms of read parallelism and write throughput. We implement Scale-RS along with two alternative scaling schemes in a Reed-Solomon-coded storage cluster, on which real-world I/O traces are replayed. Experimental results demonstrate that Scale-RS achieves the highest read performance among the three scaling schemes after data redistribution. When it comes to scaling from six data chunks to nine, Scale-RS can outperform the other two scaling schemes in terms of aggregate write throughput by a factor of 2.85 and 3.05 under online filling and offline filling, respectively. We also show that user response time is slightly enlarged during data redistribution due to bandwidth competition between migration and user I/Os.
  • Keywords
    Reed-Solomon codes; digital storage; I-O performance; RS-coded storage clusters; Reed-Solomon-coded storage clusters; bandwidth competition; data blocks; data chunk; data distribution; data migration traffic; data movement minimization; data redistribution; offline filling; online filling; parity difference blocks; parity update; read parallelism; salient features; scale erasure-coded storage clusters; scale-RS; storage capacity; write throughput; Encoding; Fault tolerance; Layout; Parallel processing; Reed-Solomon codes; Strips; Throughput; Erasure-coded storage cluster; cluster scaling; data redistribution; parity update;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2014.2326156
  • Filename
    6819450