DocumentCode
611043
Title
Hierarchical I/O Scheduling for Collective I/O
Author
Jialin Liu ; Yong Chen ; Yi Zhuang
Author_Institution
Dept. of Comput. Sci., Texas Tech Univ., Lubbock, TX, USA
fYear
2013
fDate
13-16 May 2013
Firstpage
211
Lastpage
218
Abstract
The non-contiguous access pattern of many scientific applications results in a large number of I/O requests, which can seriously limit the data-access performance. Collective I/O has been widely used to address this issue. However, the performance of collective I/O could be dramatically degraded in today´s high-performance computing system due to the increasing shuffle cost caused by highly concurrent data accesses. This situation tends to be even worse as many applications become more and more data intensive. Previous research has primarily focused on optimizing I/O access cost in collective I/O but largely ignored the shuffle cost involved. In this study, we propose a new hierarchical I/O scheduling (HIO) algorithm to address the increasing shuffle cost in collective I/O. The fundamental idea is to schedule applications´ I/O requests based on a shuffle cost analysis to achieve the optimal overall performance, instead of achieving optimal I/O accesses only. The algorithm is currently evaluated with the MPICH2 andPVFS2. Both theoretical analysis and experimental tests show that the proposed hierarchical I/O scheduling has a potential in addressing the degraded performance issue of collective I/O with highly concurrent accesses.
Keywords
concurrency control; data handling; natural sciences computing; parallel processing; scheduling; storage management; I/O access cost optimization; I/O request; MPICH2; PVFS2; collective I/O performance; data intensive applications; data-access performance; hierarchical I/O scheduling; high-performance computing system; highly concurrent data access; noncontiguous access pattern; optimal overall performance; scientific application; shuffle cost analysis; Algorithm design and analysis; Delays; Equations; Scheduling; Scheduling algorithms; Servers; big data; collective I/O; data intensive computing; high-performance computing; scheduling;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster, Cloud and Grid Computing (CCGrid), 2013 13th IEEE/ACM International Symposium on
Conference_Location
Delft
Print_ISBN
978-1-4673-6465-2
Type
conf
DOI
10.1109/CCGrid.2013.30
Filename
6546095
Link To Document