• DocumentCode
    3340525
  • Title

    Fault-tolerance for off-the-shelf applications and hardware

  • Author

    Russinovich, M. ; Segall, Z.

  • Author_Institution
    Dept. of Comput. Sci., Oregon Univ., Eugene, OR, USA
  • fYear
    1995
  • fDate
    27-30 June 1995
  • Firstpage
    67
  • Lastpage
    71
  • Abstract
    The concept of middleware provides a transparent way to augment and change the characteristics of a service provider as seen from a client. Fault tolerant policies are ideal candidates for middleware implementation. We have defined and implemented operating system based middleware support that provides the power and flexibility needed by diverse fault tolerant policies. This mechanism, called the sentry, has been built into the UNIX 4.3 BSD operating system server running on a Mach 3.0 kernel. To demonstrate the effectiveness of the mechanism several policies have been implemented using sentries including checkpointing and journaling. The implementation shows that complex fault tolerant policies can be efficiently and transparently implemented as middleware. Performance overhead of input journaling is less than 5% and application suspension during the checkpoint is typically under 10 seconds in length. A standard hard disk is used to store journal and checkpoint information with dedicated storage requirements of less than 20 MB.<>
  • Keywords
    Unix; fault tolerant computing; operating systems (computers); reliability; software fault tolerance; Mach 3.0 kernel; UNIX 4.3 BSD operating system server; application suspension; checkpointing; client; dedicated storage requirements; fault tolerant policies; fault-tolerance; hard disk; input journaling; journaling; middleware; off-the-shelf applications; off-the-shelf hardware; operating system based middleware support; performance overhead; sentry; service provider; Application software; Checkpointing; Computer science; Fault detection; Fault tolerance; Fault tolerant systems; Hardware; Middleware; Operating systems; Software standards;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault-Tolerant Computing, 1995. FTCS-25. Digest of Papers., Twenty-Fifth International Symposium on
  • Conference_Location
    Pasadena, CA, USA
  • Print_ISBN
    0-8186-7079-7
  • Type

    conf

  • DOI
    10.1109/FTCS.1995.466997
  • Filename
    466997