• DocumentCode
    1981485
  • Title

    Sampling-based garbage collection metadata management scheme for flash-based storage

  • Author

    Debnath, Biplob ; Krishnan, Srinivasan ; Xiao, Weijun ; Lilja, David J. ; Du, David H C

  • Author_Institution
    EMC Corp., Santa Clara, CA, USA
  • fYear
    2011
  • fDate
    23-27 May 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Existing garbage collection algorithms for the flash-based storage use score-based heuristics to select victim blocks for reclaiming free space and wear leveling. The score for a block is estimated using metadata information such as age, block utilization, and erase count. To quickly find a victim block, these algorithms maintain a priority queue in the SRAM of the storage controller. This priority queue takes O(K) space, where K stands for flash storage capacity in total number of blocks. As the flash capacity scales to larger size, K also scales to larger value. However, due to higher price per byte, SRAM will not scale proportionately. In this case, due to SRAM scarcity, it will be challenging to implement a larger priority queue in the limited SRAM of a large-capacity flash storage. In addition to space issue, with any update in the metadata information, the priority queue needs to be continuously updated, which takes O(lg(K)) operations. This computation overhead also increases with the increase of flash capacity. In this paper, we have taken a novel approach to solve the garbage collection metadata management problem of a large-capacity flash storage. We propose a sampling-based approach to approximate existing garbage collection algorithms in the limited SRAM space. Since these algorithms are heuristic-based, our sampling-based algorithm will perform as good as unsampled (original) algorithm, if we choose good samples to make garbage collection decisions. We propose a very simple policy to choose samples. Our experimental results show that small number of samples are good enough to emulate existing garbage collection algorithms.
  • Keywords
    SRAM chips; computational complexity; flash memories; meta data; queueing theory; sampling methods; storage management; wear; SRAM; block utilization; erase count; flash storage capacity; flash-based storage; free space; priority queue; sampling-based garbage collection metadata management scheme; score-based heuristics; storage controller; wear leveling; Aerospace electronics; Approximation algorithms; Approximation methods; Ash; Cleaning; Heuristic algorithms; Random access memory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mass Storage Systems and Technologies (MSST), 2011 IEEE 27th Symposium on
  • Conference_Location
    Denver, CO
  • ISSN
    2160-195X
  • Print_ISBN
    978-1-4577-0427-7
  • Electronic_ISBN
    2160-195X
  • Type

    conf

  • DOI
    10.1109/MSST.2011.5937228
  • Filename
    5937228