DocumentCode
2570934
Title
Testing of fault-tolerant and real-time distributed systems via protocol fault injection
Author
Dawson, Scott ; Jahanian, Farnam ; Mitton, Todd ; Tung, Teck-Lee
Author_Institution
Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
fYear
1996
fDate
25-27 Jun 1996
Firstpage
404
Lastpage
414
Abstract
As software for distributed systems becomes more complex, ensuring that a system meets its prescribed specification is a growing challenge that confronts software developers. This is particularly important for distributed applications with strict dependability and timeliness constraints. This paper reports on ORCHESTRA, a portable fault injection environment for testing implementations of distributed protocols. This tool is based on a simple yet powerful framework called script-driven probing and fault injection, for the evaluation and validation of the fault-tolerance and timing characteristics of distributed protocols. The tool, which was initially developed on the Real-Time Mach operating system and later ported to other platforms including Solaris and SunOS, has been used to conduct extensive experiments on several protocol implementations. This paper describes the design and implementation of the fault injection tool focusing on architectural features to support portability, minimizing intrusiveness on target protocols, and explicit support for testing real-time systems. The paper also describes the experimental evaluation of two protocol implementations: a real-time audio-conferencing application on Real-Time Mach, and a distributed group membership service on the Sun Solaris operating system
Keywords
distributed processing; operating systems (computers); program testing; program verification; protocols; real-time systems; software fault tolerance; software portability; ORCHESTRA; Real-Time Mach operating system; Solaris; SunOS; audioconferencing application; dependability; distributed group membership service; distributed protocols; fault injection tool; fault-tolerant systems testing; protocol fault injection; real-time distributed systems; script-driven probing; specification; timeliness constraints; timing; Application software; Fault tolerance; Fault tolerant systems; Operating systems; Protocols; Real time systems; Software systems; Sun; System testing; Timing;
fLanguage
English
Publisher
ieee
Conference_Titel
Fault Tolerant Computing, 1996., Proceedings of Annual Symposium on
Conference_Location
Sendai
ISSN
0731-3071
Print_ISBN
0-8186-7262-5
Type
conf
DOI
10.1109/FTCS.1996.534626
Filename
534626
Link To Document