Title :
An integration of the primary-shadow TMO replication scheme with a supervisor-based network surveillance scheme and its recovery time bound analysis
Author :
Kim, K.H. ; Subbaraman, Chittur
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., Irvine, CA, USA
Abstract :
The time-triggered message-triggered object (TMO) scheme was formulated a few years ago (K.H. Kim et al., 1994; K.H. Kim and C. Subbaraman, 1997), as a major extension of the conventional object structuring schemes with the idealistic goal of facilitating general form design and timeliness-guaranteed design of complex real time application systems. Recently, as a new scheme for realizing TMO-structured distributed and parallel computer systems capable of both hardware and software fault tolerance, we have formulated and demonstrated the primary-shadow TMO replication (PSTR) scheme. An important new extension of the PSTR scheme is an integration of the PSTR scheme and a network surveillance (NS) scheme. This extension results in a significant improvement in the fault coverage and recovery time bound achieved. The NS scheme adopted is a recently developed scheme, effective in a wide range of point-to-point networks and it is called the supervisor based NS (SNS) scheme. The integration of the PSTR scheme and the SNS scheme is called the PSTR/SNS scheme. The recovery time bound of the PSTR/SNS scheme is analyzed on the basis of an implementation model that can be easily adapted to various commercial operating system kernels
Keywords :
distributed processing; fault tolerant computing; message passing; object-oriented programming; operating system kernels; real-time systems; system recovery; NS scheme; PSTR scheme; PSTR/SNS scheme; SNS scheme; commercial operating system kernels; complex real time application systems; fault coverage; general form design; implementation model; object structuring schemes; parallel computer systems; point-to-point networks; primary-shadow TMO replication scheme; recovery time bound analysis; software fault tolerance; supervisor based NS; supervisor based network surveillance scheme; time-triggered message-triggered object scheme; timeliness-guaranteed design; Application software; Concurrent computing; Design engineering; Distributed computing; Electrical capacitance tomography; Fault tolerant systems; Hardware; Real time systems; Software performance; Surveillance;
Conference_Titel :
Reliable Distributed Systems, 1998. Proceedings. Seventeenth IEEE Symposium on
Conference_Location :
West Lafayette, IN
Print_ISBN :
0-8186-9218-9
DOI :
10.1109/RELDIS.1998.740490