• DocumentCode
    1922302
  • Title

    Perfect Failure Detection in the Partitioned Synchronous Distributed System Model

  • Author

    De Araujo Macedo, Raimundo Jose ; Gorender, Sergio

  • Author_Institution
    Comput. Sci. Dept., Fed. Univ. of Bahia, Salvador
  • fYear
    2009
  • fDate
    16-19 March 2009
  • Firstpage
    273
  • Lastpage
    280
  • Abstract
    In this paper we show that it is possible to implement a perfect failure detector P (one that detects all faulty processes if and only if those processes failed) in a non-synchronous distributed system. To realize that, we introduce the partitioned synchronous system (Spa) that is weaker than the conventional synchronous system. From some properties we introduce (such as strong partitioned synchrony) that must be valid in Spa and a trivially implementable timeliness oracle, we show how to implement P in Spa. Moreover, we show that even if strong partitioned synchrony is not valid, we are still able to take advantage of the existing synchronous partitions for improving the robustness of applications, by introducing a partially perfect failure detector named xP. We also disscus show applications can benefit from these failure detectors and present some related experimental data. The necessary properties and algorithms for implementing P and xP are presented in the paper, as well as the related correctness proofs.
  • Keywords
    distributed algorithms; fault tolerance; graph theory; DS graph; correctness proof; fault-tolerant problem; nonsynchronous distributed system; partitioned synchronous distributed system model; perfect failure detection algorithm; timeliness oracle; Availability; Computer crashes; Computer security; Context modeling; Detectors; Fault detection; Laboratories; Power system modeling; Robustness; Synchronization; distributed system models; failure detection; non-synchronous systems; partitioned synchronous systems; synchronous/asynchronous systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Availability, Reliability and Security, 2009. ARES '09. International Conference on
  • Conference_Location
    Fukuoka
  • Print_ISBN
    978-1-4244-3572-2
  • Electronic_ISBN
    978-0-7695-3564-7
  • Type

    conf

  • DOI
    10.1109/ARES.2009.165
  • Filename
    5066483