• DocumentCode
    260438
  • Title

    WORMStore: A Specialized Object Store for Write-Once Read-Many Workloads

  • Author

    Narayanamurthy, Srinivasan ; Muthyala, Kartheek ; Makkar, Guarav

  • fYear
    2014
  • fDate
    9-11 Sept. 2014
  • Firstpage
    91
  • Lastpage
    99
  • Abstract
    The recent increase in interest for batch analytics has resulted in extensive use of distributed frameworks such as Hadoop and Dryad. Batch analytics-as the name suggests, perform many computations on large volumes of data. That is, large quantities of data are ingested once and read many times mostly in large chunks, which is characterized as write-once read-many (WORM) workload. The storage part of these distributed frameworks (say, HDFS in Hadoop) use file systems such as ext4 or XFS as native object stores to store objects as files in individual nodes of the distributed system. These general purpose file systems were designed with broader goals such as POSIX-compliance, optimal performance for a wide range of file size, user friendliness, etc. However, most of these features are not required for a native object store in distributed file systems. WORM Store is a light weight object store that is designed exclusively for use in distributed systems for WORM workload. WORM Store provides interesting advantages such as the ability to pre-fetch large objects, small metadata to data ratio, media aware data/metadata placement, etc. As WORM Store is log-structured, it provides the ability to recover upon failure. Our experiments show that WORM Store provides a 28% increase in the read throughput per node in a Hadoop cluster.
  • Keywords
    data handling; distributed databases; network operating systems; parallel processing; write-once storage; Dryad; Hadoop; POSIX-compliance; WORM workload; WORMStore; XFS; distributed file systems; ext4; specialized object store; write-once read-many workloads; Computer crashes; Data structures; Grippers; Layout; Media; Performance evaluation; Resource management; data storage; distributed file systems; file systems; native storage; object store; write-once read-many workload;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Modelling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2014 IEEE 22nd International Symposium on
  • Conference_Location
    Paris
  • ISSN
    1526-7539
  • Type

    conf

  • DOI
    10.1109/MASCOTS.2014.19
  • Filename
    7033641