DocumentCode :
1878100
Title :
SCORPIO: A scalable two-phase parallel I/O library with application to a large scale subsurface simulator
Author :
Sreepathi, Sarat ; Sripathiy, Vamsi ; Mills, Richard ; Hammondz, Glenn ; Mahinthakumar, G. Kumar
Author_Institution :
Oak Ridge Nat. Lab., Oak Ridge, TN, USA
fYear :
2013
fDate :
18-21 Dec. 2013
Firstpage :
443
Lastpage :
451
Abstract :
Inefficient parallel I/O is known to be a major bottleneck among scientific applications employed on supercomputers as the number of processor cores grows into the thousands. Our prior experience indicated that parallel I/O libraries such as HDF5 that rely on MPI-IO do not scale well beyond 10K processor cores, especially on parallel file systems (like Lustre) with single point of resource contention. Our previous optimization efforts for a massively parallel multi-phase and multi-component subsurface simulator (PFLOTRAN) led to a two-phase I/O approach at the application level where a set of designated processes participate in the I/O process by splitting the I/O operation into a communication phase and a disk I/O phase. The designated I/O processes are created by splitting the MPI global communicator into multiple sub-communicators. The root process in each sub-communicator is responsible for performing the I/O operations for the entire group and then distributing the data to rest of the group. This approach resulted in over 25X speedup in HDF I/O read performance and 3X speedup in write performance for PFLOTRAN at over 100K processor cores on the ORNL Jaguar supercomputer. This research describes the design and development of a general purpose parallel I/O library called Scorpio that incorporates our optimized two-phase I/O approach. The library provides a simplified higher level abstraction to the user, sitting atop existing parallel I/O libraries (such as HDF5) and implements optimized I/O access patterns that can scale on larger number of processors. Performance results with standard benchmark problems and PFLOTRAN indicate that our library is able to maintain the same speedups as before with the added flexibility of being applicable to a wider range of I/O intensive applications.
Keywords :
input-output programs; mainframes; message passing; parallel programming; software libraries; HDF input-output read performance; HDF5; Lustre; MPI global communicator splitting; MPI-IO; ORNL Jaguar supercomputer; Scorpio; disk input-output phase; input-output operation; large scale subsurface simulator; multicomponent subsurface simulator PFLOTRAN; optimized input-output access patterns; parallel file systems; parallel multiphase subsurface simulator; processor cores; resource contention; scable block-oriented parallel I/O; scalable two-phase parallel input-output library; simplified higher level abstraction; subcommunicators; write performance; Computational modeling; Laboratories; Libraries; Object oriented modeling; Optimization; Scalability; Supercomputers; High Performance Computing; Parallel I/O; Subsurface modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing (HiPC), 2013 20th International Conference on
Conference_Location :
Bangalore
Type :
conf
DOI :
10.1109/HiPC.2013.6799128
Filename :
6799128
Link To Document :
بازگشت