DocumentCode :
720571
Title :
Implementation and Evaluation of MPI Nonblocking Collective I/O
Author :
Sangmin Seo ; Latham, Robert ; Junchao Zhang ; Balaji, Pavan
Author_Institution :
Math. & Comput. Sci. Div., Argonne Nat. Lab., Argonne, IL, USA
fYear :
2015
fDate :
4-7 May 2015
Firstpage :
1084
Lastpage :
1091
Abstract :
The well-known gap between relative CPU speeds and storage bandwidth results in the need for new strategies for managing I/O demands. In large-scale MPI applications, collective I/O has long been an effective way to achieve higher I/O rates, but it poses two constraints. First, although overlapping collective I/O and computation represents the next logical step toward a faster time to solution, MPI´s existing collective I/O API provides only limited support for doing so. Second, collective routines (both for I/O and communication) impose a synchronization cost in addition to a communication cost. The upcoming MPI 3.1 standard will provide a new set of nonblocking collective I/O operations to satisfy the need of applications. We present here initial work on the implementation of MPI nonblocking collective I/O operations in the MPICH MPI library. Our implementation begins with the extended two-phase algorithm used in ROMIO´s collective I/O implementation. We then utilize a state machine and the extended generalized request interface to maintain the progress of nonblocking collective I/O operations. The evaluation results indicate that our implementation performs as well as blocking collective I/O in terms of I/O bandwidth and is capable of overlapping I/O and other operations. We believe that our implementation can help users try nonblocking collective I/O operations in their applications.
Keywords :
application program interfaces; input-output programs; message passing; IO demands; MPI 3.1 standard; MPI nonblocking collective IO; MPICH MPI library; ROMIO collective IO implementation; collective IO API; communication cost; extended generalized request interface; extended two-phase algorithm; large-scale MPI applications; nonblocking collective IO operations; relative CPU speeds; storage bandwidth; synchronization cost; Bandwidth; Benchmark testing; Cloud computing; Libraries; Optimization; Standards; Synchronization; MPI I/O; extended generalized request; nonblocking collective I/O; state machine; two-phase method;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
Conference_Location :
Shenzhen
Type :
conf
DOI :
10.1109/CCGrid.2015.81
Filename :
7152595
Link To Document :
بازگشت