• DocumentCode
    1915148
  • Title

    A Plugin for HDF5 Using PLFS for Improved I/O Performance and Semantic Analysis

  • Author

    Mehta, Karan ; Bent, John ; TORRES, ABEL ; Grider, Gary ; Gabriel, Edgar

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Houston, Houston, TX, USA
  • fYear
    2012
  • fDate
    10-16 Nov. 2012
  • Firstpage
    746
  • Lastpage
    752
  • Abstract
    HDF5 is a data model, library and file format for storing and managing data. It is designed for flexible and efficient I/O for high volume and complex data. Natively, it uses a single-file format where multiple HDF5 objects are stored in a single file. In a parallel HDF5 application, multiple processes access a single file, thereby resulting in a performance bottleneck in I/O. Additionally, a single-file format does not allow semantic post processing on individual objects outside the scope of the HDF5 application. We have developed a new plugin for HDF5 using its Virtual Object Layer that serves two purposes: 1) it uses PLFS to convert the single-file layout into a data layout that is optimized for the underlying file system, and 2) it stores data in a unique way that enables semantic post-processing on data. We measure the performance of the plugin and discuss work leveraging the new semantic post-processing functionality enabled. We further discuss the applicability of this approach for exascale burst buffer storage systems.
  • Keywords
    data handling; file organisation; parallel processing; HDF5 plugin; PLFS; data layout; data management; exascale burst buffer storage system; hierarchical data format; input-output performance; parallel HDF5 application; parallel log-structured file system; semantic data post-processing; single-file format; virtual object layer; HDF5; PLFS; Parallel I/O; Semantic Analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
  • Conference_Location
    Salt Lake City, UT
  • Print_ISBN
    978-1-4673-6218-4
  • Type

    conf

  • DOI
    10.1109/SC.Companion.2012.102
  • Filename
    6495884