• DocumentCode
    2282015
  • Title

    An object-oriented approach for implementing algorithm-based fault tolerance

  • Author

    Acree, Richard K. ; Ullah, Nasr ; Karia, Adarsh ; Rahmeh, Joe T. ; Abraham, Jacob A.

  • Author_Institution
    Comput. Eng. Res. Center, Texas Univ., Austin, TX, USA
  • fYear
    1993
  • fDate
    23-26 Mar 1993
  • Firstpage
    210
  • Lastpage
    216
  • Abstract
    The authors demonstrate the practical use of an object-oriented system to incorporate fault tolerance and reliability into data objects. The object-based fault tolerance scheme uses abstraction to conceal algorithm-based fault tolerance layers. The scheme allows a layer of fault tolerance to be added to data objects without affecting or altering the use of the data objects. It is shown that the C++ class mechanisms of overloading and derivation permit the added fault tolerance to be transparent to the original data objects. To demonstrate the feasibility of this approach, using C++, a libray of matrix functions is presented and a layer of fault tolerance around matrix data objects is added. The weighted checksum code technique was implemented to create fault-tolerant matrix data objects. This allows programmers to add algorithm-based fault tolerance onto existing matrix applications without requiring modification to the original application. The implementation was experimentally evaluated using a software fault-injection tool, that emulated realistic hardware faults. An error coverage of over 96% was obtained with a memory overhead of 28%. The empirical results confirm the viability of the approach by demonstrating that object-based encapsulation is a valid method for transparently implementing algorithm-based fault tolerance
  • Keywords
    C language; fault tolerant computing; object-oriented programming; software fault tolerance; C++ class mechanisms; abstraction; algorithm-based fault tolerance; matrix functions; object-based encapsulation; object-oriented approach; reliability; software fault-injection tool; weighted checksum code technique; Application software; Costs; Data engineering; Fault tolerance; Fault tolerant systems; Hardware; Jacobian matrices; Operating systems; RNA; Reliability engineering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computers and Communications, 1993., Twelfth Annual International Phoenix Conference on
  • Conference_Location
    Tempe, AZ
  • Print_ISBN
    0-7803-0922-7
  • Type

    conf

  • DOI
    10.1109/PCCC.1993.344462
  • Filename
    344462