• DocumentCode
    2253858
  • Title

    On the quality of service of failure detectors

  • Author

    Chen, Wei ; Toueg, Sam ; Aguilera, Marcos Kawazoe

  • Author_Institution
    Oracle Corp., Nashua, NH, USA
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    191
  • Lastpage
    200
  • Abstract
    Studies the quality of service (QoS) of failure detectors. By QoS, we mean a specification that quantifies (a) how fast the failure detector detects actual failures, and (b) how well it avoids false detections. We first propose a set of QoS metrics to specify failure detectors for systems with probabilistic behaviors, i.e. for systems where message delays and message losses follow some probability distributions. We then give a new failure detector algorithm and analyze its QoS in terms of the proposed metrics. We show that, among a large class of failure detectors, the new algorithm is optimal with respect to some of these QoS metrics. Given a set of failure detector QoS requirements, we show how to compute the parameters of our algorithm so that it satisfies these requirements, and we show how this can be done even if the probabilistic behavior of the system is not known. Finally, we briefly explain how to make our failure detector adaptive, so that it automatically reconfigures itself when there is a change in the probabilistic behavior of the network
  • Keywords
    adaptive systems; delays; failure analysis; fault tolerance; formal specification; losses; probability; quality of service; QoS metrics; adaptive detector; automatic reconfiguration; failure detection speed; failure detectors; false detections; message delays; message losses; optimal algorithm; parameter computation; probabilistic behaviour; probability distributions; service quality; specification; Algorithm design and analysis; Computer crashes; Computer science; Delay; Detectors; Failure analysis; Operating systems; Protocols; Quality of service; Timing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable Systems and Networks, 2000. DSN 2000. Proceedings International Conference on
  • Conference_Location
    New York, NY
  • Print_ISBN
    0-7695-0707-7
  • Type

    conf

  • DOI
    10.1109/ICDSN.2000.857535
  • Filename
    857535