Author :
Ozaki, Brenda M. ; Fernandez, Eduardo B. ; Gudes, Ehud
Abstract :
The effect on software fault tolerance of hardware features such as hierarchical privilege levels (rings), the use of descriptors for memory protection, separated virtual address spaces, and ring crossings that enforce specific energy points is considered. A strategy that uses a separate programming layer, the recovery layer, to handle fault-tolerant aspects of process interactions is discussed. The recovery metaprogram (RPM) which monitors the run-time behavior of the application program and coordinates error detection, recovery, and reconfiguration, is examined, focusing on privilege levels, which provide protection against error propagation, RMP implementation, and conversations. The intel 80286 has been used as a sample implementation vehicle, but most of the discussion applies to any machine with a similar range of features. Extension to multiprocessor systems is indicated.<>
Keywords :
fault tolerant computing; software reliability; RPM; application program; architectures; descriptors; energy points; error detection; error propagation; hierarchical privilege levels; hierarchical protection levels; intel 80286; memory protection; multiprocessor systems; privilege levels; process interactions; programming layer; reconfiguration; recovery; recovery layer; recovery metaprogram; ring crossings; rings; run-time behavior; separated virtual address spaces; software fault tolerance; Computer architecture; Control systems; Fault tolerance; Hardware; Microprocessors; Programming profession; Protection; Redundancy; Software testing; Voting;