• DocumentCode
    390034
  • Title

    Implementation of threshold-based diagnostic mechanisms for COTS-based applications

  • Author

    Romano, L. ; Bondavalli, A. ; Chiaradonna, S. ; Cotroneo, D.

  • Author_Institution
    Dipt. di Informatica e Sistemistica, Universita degli Studi di Napoli, Italy
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    296
  • Lastpage
    303
  • Abstract
    This work investigates feasibility issues that must be addressed when threshold-based mechanisms are to be used for diagnostic purposes in COTS-based distributed systems. Threshold based mechanisms have typically been used for such purposes in embedded systems. A variety of solutions exist, with different characteristics of completeness, accuracy, and induced overhead. We first discuss the challenges related to applying such mechanisms to COTS-based distributed applications. We then identify alternative strategies for diagnosis, which use run-time data on COTS component service failures to trigger alarms to reconfiguration and fault treatment mechanisms. We implement those strategies in a system prototype, which is based on a substantial application, i.e. a real world (as opposed to a toy) application. We discuss the relationships between the sensitivity of the quality of service (QoS) provided by the diagnostic mechanisms and the accuracy of the available failure data. Our considerations and preliminary experiments on the prototype suggest that a careful evaluation of tradeoffs must be conducted, in order to achieve the best compromise between accuracy and cost, which depends on application characteristics, and service deployment requirements.
  • Keywords
    distributed object management; program diagnostics; quality of service; software fault tolerance; software packages; CORBA; COTS component service failures; COTS-based distributed systems; alarms; distributed architectures; embedded systems; experiments; fault treatment mechanisms; quality of service; reconfiguration; run-time data; service deployment requirements; threshold-based diagnostic mechanisms; Application software; Bonding; Computer architecture; Costs; Embedded system; Fault diagnosis; Information systems; Prototypes; Quality of service; Runtime;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems, 2002. Proceedings. 21st IEEE Symposium on
  • ISSN
    1060-9857
  • Print_ISBN
    0-7695-1659-9
  • Type

    conf

  • DOI
    10.1109/RELDIS.2002.1180200
  • Filename
    1180200