• DocumentCode
    3145694
  • Title

    Metadata Traces and Workload Models for Evaluating Big Storage Systems

  • Author

    Abad, Cristina L. ; Huong Luu ; Roberts, Nick ; Kihwal Lee ; Yi Lu ; Campbell, Roy H.

  • Author_Institution
    Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • fYear
    2012
  • fDate
    5-8 Nov. 2012
  • Firstpage
    125
  • Lastpage
    132
  • Abstract
    Efficient namespace metadata management is increasingly important as next-generation file systems are designed for peta and exascales. New schemes have been proposed, however, their evaluation has been insufficient due to a lack of appropriate namespace metadata traces. Specifically, no Big Data storage system metadata trace is publicly available and existing ones are a poor replacement. We studied publicly available traces and one Big Data trace from Yahoo! and note some of the differences and their implications to metadata management studies. We discuss the insufficiency of existing evaluation approaches and present a first step towards a statistical metadata workload model that can capture the relevant characteristics of a workload and is suitable for synthetic workload generation. We describe Mimesis, a synthetic workload generator, and evaluate its usefulness through a case study in a least recently used metadata cache for the Hadoop Distributed File System. Simulation results show that the traces generated by Mimesis mimic the original workload and can be used in place of the real trace providing accurate results.
  • Keywords
    Internet; cache storage; meta data; public domain software; statistical analysis; Hadoop distributed file system; Yahoo!; big data storage system evaluation; exascale; metadata cache; metadata traces; namespace metadata management; next-generation file systems; petascale; statistical metadata workload model; synthetic workload generation; Benchmark testing; Computational modeling; Data handling; Data storage systems; Information management; Servers; Shape; Big Data; HDFS; MDS; metadata; storage;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Utility and Cloud Computing (UCC), 2012 IEEE Fifth International Conference on
  • Conference_Location
    Chicago, IL
  • Print_ISBN
    978-1-4673-4432-6
  • Type

    conf

  • DOI
    10.1109/UCC.2012.27
  • Filename
    6424937