Title :
Analysis of a Class of Recovery Procedures
Author :
Koren, Israel ; Koren, Zahava ; Su, Stephen Y H
Author_Institution :
Department of Electrical Engineering, Technion—Israel Institute of Technology
Abstract :
Recovery procedures involving time redundancy in the form of instruction retries and program rollbacks have proved to be very effective against transient failures in computer systems. A class of such recovery procedures is presented and analyzed here, and the parameters of each procedure are determined so that the system´s operation is optimized. These procedures are then compared in order to select the most appropriate one for given system parameters.
Keywords :
Checkpoint; error latency; error recovery procedures; instruction retry; intermittent faults; permanent faults; program rollback; Availability; Computer aided instruction; Computer errors; Computer science; Costs; Delay effects; Fault detection; Fault diagnosis; Hardware; Redundancy; Checkpoint; error latency; error recovery procedures; instruction retry; intermittent faults; permanent faults; program rollback;
Journal_Title :
Computers, IEEE Transactions on
DOI :
10.1109/TC.1986.1676821