• DocumentCode
    2483148
  • Title

    An integration of the primary-shadow TMO replication scheme with a supervisor-based network surveillance scheme and its recovery time bound analysis

  • Author

    Kim, K.H. ; Subbaraman, Chittur

  • Author_Institution
    Dept. of Electr. & Comput. Eng., California Univ., Irvine, CA, USA
  • fYear
    1998
  • fDate
    20-23 Oct 1998
  • Firstpage
    168
  • Lastpage
    176
  • Abstract
    The time-triggered message-triggered object (TMO) scheme was formulated a few years ago (K.H. Kim et al., 1994; K.H. Kim and C. Subbaraman, 1997), as a major extension of the conventional object structuring schemes with the idealistic goal of facilitating general form design and timeliness-guaranteed design of complex real time application systems. Recently, as a new scheme for realizing TMO-structured distributed and parallel computer systems capable of both hardware and software fault tolerance, we have formulated and demonstrated the primary-shadow TMO replication (PSTR) scheme. An important new extension of the PSTR scheme is an integration of the PSTR scheme and a network surveillance (NS) scheme. This extension results in a significant improvement in the fault coverage and recovery time bound achieved. The NS scheme adopted is a recently developed scheme, effective in a wide range of point-to-point networks and it is called the supervisor based NS (SNS) scheme. The integration of the PSTR scheme and the SNS scheme is called the PSTR/SNS scheme. The recovery time bound of the PSTR/SNS scheme is analyzed on the basis of an implementation model that can be easily adapted to various commercial operating system kernels
  • Keywords
    distributed processing; fault tolerant computing; message passing; object-oriented programming; operating system kernels; real-time systems; system recovery; NS scheme; PSTR scheme; PSTR/SNS scheme; SNS scheme; commercial operating system kernels; complex real time application systems; fault coverage; general form design; implementation model; object structuring schemes; parallel computer systems; point-to-point networks; primary-shadow TMO replication scheme; recovery time bound analysis; software fault tolerance; supervisor based NS; supervisor based network surveillance scheme; time-triggered message-triggered object scheme; timeliness-guaranteed design; Application software; Concurrent computing; Design engineering; Distributed computing; Electrical capacitance tomography; Fault tolerant systems; Hardware; Real time systems; Software performance; Surveillance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems, 1998. Proceedings. Seventeenth IEEE Symposium on
  • Conference_Location
    West Lafayette, IN
  • ISSN
    1060-9857
  • Print_ISBN
    0-8186-9218-9
  • Type

    conf

  • DOI
    10.1109/RELDIS.1998.740490
  • Filename
    740490