Title :
Progress in real-time fault tolerance
Author :
Melliar-Smith, P.M. ; Moser, L.E.
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., Santa Barbara, CA, USA
Abstract :
This paper discusses progress in the field of real-time fault tolerance. In particular, it considers synchronous vs. asynchronous fault tolerance designs, maintaining replica consistency, alternative fault tolerance strategies, including checkpoint restoration, transactions, and consistent replay, and custom vs. generic fault tolerance.
Keywords :
checkpointing; distributed processing; fault tolerant computing; asynchronous fault tolerance; checkpoint restoration; real-time fault tolerance; replica consistency; synchronous fault tolerance; Application software; Costs; Delay; Fault detection; Fault tolerance; Hardware; Microprocessors; Probability distribution; Real time systems; Timing;
Conference_Titel :
Reliable Distributed Systems, 2004. Proceedings of the 23rd IEEE International Symposium on
Print_ISBN :
0-7695-2239-4
DOI :
10.1109/RELDIS.2004.1353010