• DocumentCode
    1304304
  • Title

    On the Quality of Service of Crash-Recovery Failure Detectors

  • Author

    Ma, Tiejun ; Hillston, Jane ; Anderson, Stuart

  • Author_Institution
    Dept. of Comput., Imperial Coll. London, London, UK
  • Volume
    7
  • Issue
    3
  • fYear
    2010
  • Firstpage
    271
  • Lastpage
    283
  • Abstract
    We model the probabilistic behavior of a system comprising a failure detector and a monitored crash-recovery target. We extend failure detectors to take account of failure recovery in the target system. This involves extending QoS measures to include the recovery detection speed and proportion of failures detected. We also extend estimating the parameters of the failure detector to achieve a required QoS to configuring the crash-recovery failure detector. We investigate the impact of the dependability of the monitored process on the QoS of our failure detector. Our analysis indicates that variation in the MTTF and MTTR of the monitored process can have a significant impact on the QoS of our failure detector. Our analysis is supported by simulations that validate our theoretical results.
  • Keywords
    computerised monitoring; failure analysis; fault diagnosis; quality of service; software fault tolerance; system recovery; MTTF process; MTTR process; crash recovery failure detector; failure recovery; monitored crash recovery target; probabilistic system behavior; quality of service; recovery detection speed; Analytical models; Availability; Computer crashes; Condition monitoring; Detectors; Failure analysis; Information retrieval; Parameter estimation; Quality of service; Velocity measurement; Failure detectors; availability; crash recovery; dependability; performance.; quality of service;
  • fLanguage
    English
  • Journal_Title
    Dependable and Secure Computing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5971
  • Type

    jour

  • DOI
    10.1109/TDSC.2009.35
  • Filename
    5210115