• DocumentCode
    692908
  • Title

    Characterization and modeling of PIDX parallel I/O for performance optimization

  • Author

    Kumar, Sudhakar ; Saha, Ankita ; Vishwanath, Venkatram ; Carns, Philip ; Schmidt, John A. ; Scorzelli, Giorgio ; Kolla, Hemanth ; Grout, Ray ; Latham, Rob ; Ross, Robert ; Papka, Michael E. ; Chen, Jiann-Jong ; Pascucci, V.

  • Author_Institution
    Sci. Comput. & Imaging Inst., Univ. of Utah, Salt Lake City, UT, USA
  • fYear
    2013
  • fDate
    17-22 Nov. 2013
  • Firstpage
    1
  • Lastpage
    12
  • Abstract
    Parallel I/O library performance can vary greatly in response to user-tunable parameter values such as aggregator count, file count, and aggregation strategy. Unfortunately, manual selection of these values is time consuming and dependent on characteristics of the target machine, the underlying file system, and the dataset itself. Some characteristics, such as the amount of memory per core, can also impose hard constraints on the range of viable parameter values. In this work we address these problems by using machine learning techniques to model the performance of the PIDX parallel I/O library and select appropriate tunable parameter values. We characterize both the network and I/O phases of PIDX on a Cray XE6 as well as an IBM Blue Gene/P system. We use the results of this study to develop a machine learning model for parameter space exploration and performance prediction.
  • Keywords
    input-output programs; learning (artificial intelligence); parallel processing; software libraries; software performance evaluation; Cray XE6; I/O phases; IBM Blue Gene/P system; PIDX parallel I/O characterization; PIDX parallel I/O modeling; aggregation strategy; aggregator count; file count; file system; hard constraints; machine learning techniques; network phases; parallel I/O library performance prediction; parameter space exploration; performance optimization; target machine; user-tunable parameter value selection; Adaptation models; Data visualization; Laboratories; Libraries; Memory management; Predictive models; Throughput; I/O & Network Characterization; Performance Modeling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SC), 2013 International Conference for
  • Conference_Location
    Denver, CO
  • Print_ISBN
    978-1-4503-2378-9
  • Type

    conf

  • DOI
    10.1145/2503210.2503252
  • Filename
    6877500