• DocumentCode
    3349521
  • Title

    Experimental evaluation of the fail-silent behavior of a distributed real-time run-time support built from COTS components

  • Author

    Chevochot, Pascal ; Puaut, Isabelle

  • Author_Institution
    IRISA, Rennes, France
  • fYear
    2001
  • fDate
    1-4 July 2001
  • Firstpage
    304
  • Lastpage
    313
  • Abstract
    Mainly for economic and maintainability reasons, more and more dependable real-time systems are being built from commercial off-the-shelf (COTS) components. To build these systems, a commonly-used assumption is that computers are fail-silent. The goal of our work is so determine the coverage of the fail-silence assumption for computers executing a real-time run-time support system built exclusively from COTS components, in the presence of physical faults. The evaluation of fail-silence has been performed on the HADES (Highly Available Distributed Embedded System) run-time support system, aimed at executing distributed hard real-time dependable applications. The main result of the evaluation is a fail-silence coverage of 99.1%. Moreover, we evaluate the error detection mechanisms embedded in HADES according to a rich set of metrics which provides guidance for choosing the set of error detection mechanisms that is best suited to the system needs (e.g. find the best trade-off between fail-silence coverage and overhead caused by error detection).
  • Keywords
    distributed processing; error detection; program interpreters; real-time systems; software fault tolerance; software packages; software performance evaluation; subroutines; COTS components; HADES; Highly Available Distributed Embedded System; commercial off-the-shelf components; dependable real-time systems; distributed real-time run-time support system; economics; error detection mechanisms; fail-silence coverage; fail-silent behavior; maintainability; overhead; physical faults; system needs; Application software; Computer errors; Costs; Distributed computing; Fault detection; Fault tolerance; Hardware; Multicast protocols; Real time systems; Runtime;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable Systems and Networks, 2001. DSN 2001. International Conference on
  • Conference_Location
    Goteborg, Sweden
  • Print_ISBN
    0-7695-1101-5
  • Type

    conf

  • DOI
    10.1109/DSN.2001.941415
  • Filename
    941415