Title :
Error recovery in asynchronous systems
Author :
Campbell, Roy H. ; Randell, Brian
Author_Institution :
Dept. of Comput. Sci., Illinois Univ., Urbana, IL, USA
Abstract :
A framework for the provision of fault tolerance in asynchronous systems is introduced. The proposal generalizes the form of simple recovery facilities supported by nested atomic actions in which the exception mechanisms only permit backward error recovery. It allows the construction of systems using both forward and backward error recovery and thus allows the exploitation of the complementary benefits of the two schemes. Backward recovery, forward recovery, and normal processing activities can occur concurrently within the organization proposed. Exception handling is generalized to provide a uniform basis for fault tolerance schemes with the atomic action structure. The generalization includes a resolution scheme for concurrently raised exceptions based on an exception tree and an abortion scheme that permits the termination of the internal atomic actions. An automatic resolution mechanism is outlined for exceptions in atomic actions which allows users to separate their recovery schemes from the details of the underlying algorithms.
Keywords :
fault tolerant computing; software reliability; system recovery; asynchronous systems; automatic resolution mechanism; error recovery; exception handling; fault tolerance; nested atomic actions; software reliability; Atomic measurements; Computers; Context; Fault tolerance; Fault tolerant systems; Protocols; Software; Asynchronous systems; atomic actions; error recovery; exception mechanism; programming techniques; software fault tolerance; software reliability;
Journal_Title :
Software Engineering, IEEE Transactions on
DOI :
10.1109/TSE.1986.6312984