DocumentCode
692908
Title
Characterization and modeling of PIDX parallel I/O for performance optimization
Author
Kumar, Sudhakar ; Saha, Ankita ; Vishwanath, Venkatram ; Carns, Philip ; Schmidt, John A. ; Scorzelli, Giorgio ; Kolla, Hemanth ; Grout, Ray ; Latham, Rob ; Ross, Robert ; Papka, Michael E. ; Chen, Jiann-Jong ; Pascucci, V.
Author_Institution
Sci. Comput. & Imaging Inst., Univ. of Utah, Salt Lake City, UT, USA
fYear
2013
fDate
17-22 Nov. 2013
Firstpage
1
Lastpage
12
Abstract
Parallel I/O library performance can vary greatly in response to user-tunable parameter values such as aggregator count, file count, and aggregation strategy. Unfortunately, manual selection of these values is time consuming and dependent on characteristics of the target machine, the underlying file system, and the dataset itself. Some characteristics, such as the amount of memory per core, can also impose hard constraints on the range of viable parameter values. In this work we address these problems by using machine learning techniques to model the performance of the PIDX parallel I/O library and select appropriate tunable parameter values. We characterize both the network and I/O phases of PIDX on a Cray XE6 as well as an IBM Blue Gene/P system. We use the results of this study to develop a machine learning model for parameter space exploration and performance prediction.
Keywords
input-output programs; learning (artificial intelligence); parallel processing; software libraries; software performance evaluation; Cray XE6; I/O phases; IBM Blue Gene/P system; PIDX parallel I/O characterization; PIDX parallel I/O modeling; aggregation strategy; aggregator count; file count; file system; hard constraints; machine learning techniques; network phases; parallel I/O library performance prediction; parameter space exploration; performance optimization; target machine; user-tunable parameter value selection; Adaptation models; Data visualization; Laboratories; Libraries; Memory management; Predictive models; Throughput; I/O & Network Characterization; Performance Modeling;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SC), 2013 International Conference for
Conference_Location
Denver, CO
Print_ISBN
978-1-4503-2378-9
Type
conf
DOI
10.1145/2503210.2503252
Filename
6877500
Link To Document