• DocumentCode
    1977982
  • Title

    Designing a robust namespace for distributed file services

  • Author

    Zhang, Zheng ; Karamanolis, Christos

  • Author_Institution
    Labs., Hewlett-Packard Co., Palo Alto, CA, USA
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    162
  • Lastpage
    171
  • Abstract
    A number of ongoing research projects follow a partition-based approach to provide highly scalable distributed storage services. These systems maintain namespaces that reference objects distributed across multiple locations in the system. Typically, atomic commitment protocols, such as 2-phase commit, are used for updating the namespace, in order to guarantee its consistency even in the presence of failures. Atomic commitment protocols are known to impose a high overhead to failure-free execution. Furthermore, they use conservative recovery procedures and may considerably restrict the concurrency of overlapping operations in the system. This paper proposes a set of new protocols implementing the fundamental operations in a distributed namespace. The protocols impose a minimal overhead to failure-free execution. They are robust against both communication and host failures, and use aggressive recovery procedures to re-execute incomplete operations. The proposed protocols are compared with their 2-phase commit counterparts and are shown to outperform them in all critical performance factors: communication round-trips, synchronous I/O, operation concurrency
  • Keywords
    distributed databases; fault tolerant computing; memory protocols; naming services; 2-phase commit; aggressive recovery procedures; atomic commitment protocols; communication round-trips; distributed file services; operation concurrency; partition-based approach; robust namespace design; synchronous I/O; Concurrent computing; Control systems; Electrical capacitance tomography; File systems; Laboratories; Lapping; Milling machines; Protocols; Robustness; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems, 2001. Proceedings. 20th IEEE Symposium on
  • Conference_Location
    New Orleans, LA
  • ISSN
    1060-9857
  • Print_ISBN
    0-7695-1366-2
  • Type

    conf

  • DOI
    10.1109/RELDIS.2001.969770
  • Filename
    969770