DocumentCode :
2092021
Title :
Enhancing Checkpoint Performance with Staging IO and SSD
Author :
Ouyang, Xiangyong ; Marcarelli, Sonya ; Panda, Dhabaleswar K.
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
fYear :
2010
fDate :
3-3 May 2010
Firstpage :
13
Lastpage :
20
Abstract :
With the ever-growing size of computer clusters and applications, system failures are becoming inevitable. Checkpointing, a strategy to ensure fault tolerance, has become imperative in such an environment. However existing mechanism of checkpoint writing to parallel systems doesn´t perform well with increasing job size. Solid State Disk(SSD) is attracting more and more attention due to its technical merits such as good random access performance, low power consumption and shock resistance. However, how to apply SSDs into a parallel storage system to improve checkpoint writing still remains an open question. In this paper we propose a new strategy to enhance checkpoint writing performance by aggregating checkpoint writing at client side, and utilizing staging IO on data servers. We also explore the potentials to substitute traditional hard disks with SSDs on data server to achieve better write bandwidth. Our strategy achieves up to 6.3 times higher write bandwidth than a popular parallel file system PVFS2 with 8 client nodes and 4 data servers. In experiments with real applications using 64 application processes and 4 data servers, our strategy can accelerate checkpoint writing by up to 9.9 times compared to PVFS2.
Keywords :
checkpointing; client-server systems; disc storage; SSD; checkpoint writing performance; data server; fault tolerance; parallel storage system; power consumption; random access performance; shock resistance; solid state disk; staging IO; Bandwidth; Checkpointing; Instruction sets; Libraries; Message systems; Servers; Writing; Checkpoint; IO Aggregation; SSD;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Storage Network Architecture and Parallel I/Os (SNAPI), 2010 International Workshop on
Conference_Location :
Incline Village, NV
Print_ISBN :
978-1-4244-6810-2
Electronic_ISBN :
978-1-4244-6811-9
Type :
conf
DOI :
10.1109/SNAPI.2010.10
Filename :
5572849
Link To Document :
بازگشت