• DocumentCode
    1976844
  • Title

    Primary-backup replication: from a time-free protocol to a time-based implementation

  • Author

    Oliveira, Rui ; Pereira, José ; Schiper, André

  • Author_Institution
    Univ. do Minho, Portugal
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    14
  • Lastpage
    23
  • Abstract
    Fault-tolerant control systems can be built by replicating critical components. However replication raises the issue of inconsistency. Multiple protocols for ensuring consistency have been described in the literature. PADRE (Protocol for Asymmetric Duplex REdundancy) is such a protocol, and an interesting case study of a complex and sensitive problem: the management of replicated traffic controllers in a railway system. However, the low level at which the protocol has been developed embodies system details, namely timeliness assumptions, that make it difficult to understand and may narrow its applicability. We argue that, when designing a protocol, it is preferable to consider first a general solution that does not include any timeliness assumptions; then, by taking into account an additional hypothesis, one can easily design a time-based solution tailored to a specific environment. This paper illustrates the benefit of a top-down protocol design approach and shows that PADRE can be seen as an instance of a standard primary-backup replication protocol based on view-synchronous communication (VSC)
  • Keywords
    back-up procedures; computerised control; fault tolerant computing; memory protocols; rail traffic; redundancy; replicated databases; traffic control; PADRE; applicability; asymmetric duplex redundancy; fault-tolerant control systems; inconsistency; primary-backup replication protocol; replicated critical components; replicated traffic controller management; system details; time-based implementation; time-free protocol; timeliness assumptions; top-down protocol design approach; view-synchronous communication; Communication standards; Communication system control; Control systems; Delay effects; Fault tolerance; Fault tolerant systems; Protocols; Rail transportation; Redundancy; Time factors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems, 2001. Proceedings. 20th IEEE Symposium on
  • Conference_Location
    New Orleans, LA
  • ISSN
    1060-9857
  • Print_ISBN
    0-7695-1366-2
  • Type

    conf

  • DOI
    10.1109/RELDIS.2001.969730
  • Filename
    969730