Title :
An integrated approach to fault tolerance
Author :
Elnozahy, Elmootazbellah N. ; Zwaenepoel, Willy
Author_Institution :
Dept. of Comput. Sci., Rice Univ., Houston, TX, USA
Abstract :
Describes Manetho, an experimental protocol system, whose goal is to explore the extent to which transparent fault tolerance can be added to long-running distributed applications. Transparent techniques are attractive because they can automatically add fault tolerance to existing applications that were written without consideration for reliability. Previous techniques for providing transparent fault-tolerance relied on rollback-recovery. However, rollback recovery is not appropriate for server processes where the lack of service during rollback is intolerable. Furthermore, rollback-recovery assumes that a process can be restarted on any available host. As a result, extended downtime cannot be tolerated for example in file servers, which have to run on the host where the disks reside. Manetho solves these problems with an integrated approach by using process replication for server processes and rollback-recovery for client processes
Keywords :
distributed databases; fault tolerant computing; protocols; Manetho; client processes; integrated approach; process replication; protocols; rollback-recovery; transparent fault tolerance; Application software; Availability; Computer science; Delay; Fault tolerance; Fault tolerant systems; File servers; Protocols;
Conference_Titel :
Management of Replicated Data, 1992., Second Workshop on the
Conference_Location :
Monterey, CA
Print_ISBN :
0-8186-3170-8
DOI :
10.1109/MRD.1992.242611