Title :
Designing a robust namespace for distributed file services
Author :
Zhang, Zheng ; Karamanolis, Christos
Author_Institution :
Labs., Hewlett-Packard Co., Palo Alto, CA, USA
Abstract :
A number of ongoing research projects follow a partition-based approach to provide highly scalable distributed storage services. These systems maintain namespaces that reference objects distributed across multiple locations in the system. Typically, atomic commitment protocols, such as 2-phase commit, are used for updating the namespace, in order to guarantee its consistency even in the presence of failures. Atomic commitment protocols are known to impose a high overhead to failure-free execution. Furthermore, they use conservative recovery procedures and may considerably restrict the concurrency of overlapping operations in the system. This paper proposes a set of new protocols implementing the fundamental operations in a distributed namespace. The protocols impose a minimal overhead to failure-free execution. They are robust against both communication and host failures, and use aggressive recovery procedures to re-execute incomplete operations. The proposed protocols are compared with their 2-phase commit counterparts and are shown to outperform them in all critical performance factors: communication round-trips, synchronous I/O, operation concurrency
Keywords :
distributed databases; fault tolerant computing; memory protocols; naming services; 2-phase commit; aggressive recovery procedures; atomic commitment protocols; communication round-trips; distributed file services; operation concurrency; partition-based approach; robust namespace design; synchronous I/O; Concurrent computing; Control systems; Electrical capacitance tomography; File systems; Laboratories; Lapping; Milling machines; Protocols; Robustness; Scalability;
Conference_Titel :
Reliable Distributed Systems, 2001. Proceedings. 20th IEEE Symposium on
Conference_Location :
New Orleans, LA
Print_ISBN :
0-7695-1366-2
DOI :
10.1109/RELDIS.2001.969770