• DocumentCode
    2570934
  • Title

    Testing of fault-tolerant and real-time distributed systems via protocol fault injection

  • Author

    Dawson, Scott ; Jahanian, Farnam ; Mitton, Todd ; Tung, Teck-Lee

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
  • fYear
    1996
  • fDate
    25-27 Jun 1996
  • Firstpage
    404
  • Lastpage
    414
  • Abstract
    As software for distributed systems becomes more complex, ensuring that a system meets its prescribed specification is a growing challenge that confronts software developers. This is particularly important for distributed applications with strict dependability and timeliness constraints. This paper reports on ORCHESTRA, a portable fault injection environment for testing implementations of distributed protocols. This tool is based on a simple yet powerful framework called script-driven probing and fault injection, for the evaluation and validation of the fault-tolerance and timing characteristics of distributed protocols. The tool, which was initially developed on the Real-Time Mach operating system and later ported to other platforms including Solaris and SunOS, has been used to conduct extensive experiments on several protocol implementations. This paper describes the design and implementation of the fault injection tool focusing on architectural features to support portability, minimizing intrusiveness on target protocols, and explicit support for testing real-time systems. The paper also describes the experimental evaluation of two protocol implementations: a real-time audio-conferencing application on Real-Time Mach, and a distributed group membership service on the Sun Solaris operating system
  • Keywords
    distributed processing; operating systems (computers); program testing; program verification; protocols; real-time systems; software fault tolerance; software portability; ORCHESTRA; Real-Time Mach operating system; Solaris; SunOS; audioconferencing application; dependability; distributed group membership service; distributed protocols; fault injection tool; fault-tolerant systems testing; protocol fault injection; real-time distributed systems; script-driven probing; specification; timeliness constraints; timing; Application software; Fault tolerance; Fault tolerant systems; Operating systems; Protocols; Real time systems; Software systems; Sun; System testing; Timing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault Tolerant Computing, 1996., Proceedings of Annual Symposium on
  • Conference_Location
    Sendai
  • ISSN
    0731-3071
  • Print_ISBN
    0-8186-7262-5
  • Type

    conf

  • DOI
    10.1109/FTCS.1996.534626
  • Filename
    534626