DocumentCode :
2875778
Title :
HMF: High-available Message-passing Framework for Cluster File System
Author :
Yang, Dong ; Chen, Zhuan ; Tang, Rongfeng ; Xiong, Jin ; Meng, Dan
Author_Institution :
Inst. of Comput. Technol., Grad. Univ. of Chinese Acad. of Sci., Beijing, China
fYear :
2009
fDate :
9-11 July 2009
Firstpage :
249
Lastpage :
252
Abstract :
In large-scale cluster systems, the failure rate of network connection is non-negligibly high. A cluster file system must have the ability to handle network failures in order to provide high-available data accesses service. Traditionally, network failure handling is only guaranteed by network protocol, or implemented within the file system semantic layer. We present the high-available message-passing framework which is called HMF. Based on the operation hierarchy in cluster file system, HMF guarantees the availability of each pair of network transmissions and their interaction with the file system sub-operations. It separates the network fault-tolerance design from the file system and keeps a simple interface between them. HMF could handle a lot of network failures internally, which greatly simplifies the implementation of file system semantic layer. Performance results show that HMF can increase the availability of message passing and reduce the cost of recovery from network failures. When there are two network channels, HMF also improves aggregate I/O bandwidth by 80% in normal condition while the performance degradation due to recovery is below 10%.
Keywords :
fault tolerance; file organisation; file servers; message passing; transport protocols; cluster file system; high-available message-passing; network connection; network failure handling; network fault-tolerance design; network protocol; Access protocols; Aggregates; Availability; Bandwidth; Costs; Degradation; Fault tolerant systems; File systems; Large-scale systems; Message passing; cluster file system; high availability; message passing layer;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Networking, Architecture, and Storage, 2009. NAS 2009. IEEE International Conference on
Conference_Location :
Hunan
Print_ISBN :
978-0-7695-3741-2
Type :
conf
DOI :
10.1109/NAS.2009.47
Filename :
5197333
Link To Document :
بازگشت