• DocumentCode
    3331400
  • Title

    Supporting computational data model representation with high-performance I/O in parallel netCDF

  • Author

    Gao, Kui ; Jin, Chen ; Choudhary, Alok ; Liao, Wei-keng

  • fYear
    2011
  • fDate
    18-21 Dec. 2011
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    Parallel computational scientific applications have been described by their computation and communication patterns. From a storage and I/O perspective, these applications can also be grouped into separate data models based on the way data is organized and accessed during simulation, analysis, and visualization. Parallel netCDF is a popular library used in many scientific applications to store scientific datasets and provides high-performance parallel I/O. Although the metadata-rich netCDF file format can effectively store and describe regular multi-dimensional array datasets, it does not address the full range of current and future computational science data models. In this paper, we present a new storage scheme in Parallel netCDF to represent a broad variety of data models used in modern computational scientific applications. This scheme also allows concurrent metadata construction for different data objects from multiple groups of application processes, an important feature in obtaining a high degree of I/O parallelism for data models exhibiting irregular data distribution. Furthermore, we employ non-blocking I/O functions to aggregate irregularly distributed data requests into large, contiguous data requests, to achieve high-performance I/O. Using an example of adaptive mesh refinement data model, we demonstrate the proposed scheme can produce scalable performance results for both data and metadata creation and access.
  • Keywords
    data structures; input-output programs; meta data; parallel processing; computational data model representation; high-performance I/O; metadata construction; metadata-rich netCDF file format; multidimensional array datasets; netCDF file format; parallel computational scientific applications; parallel netCDF; pattern communication; Arrays; Collaboration; Data models; Distributed databases; Libraries; Optimization; Parallel processing; Data Model; Parallel I/O; Parallel netCDF;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing (HiPC), 2011 18th International Conference on
  • Conference_Location
    Bangalore
  • Print_ISBN
    978-1-4577-1951-6
  • Electronic_ISBN
    978-1-4577-1949-3
  • Type

    conf

  • DOI
    10.1109/HiPC.2011.6152746
  • Filename
    6152746