DocumentCode :
1686273
Title :
A generic log-service supporting fast recovery in distributed fault-tolerant systems
Author :
Wirz, B. ; Nett, E.
Author_Institution :
Gesellschaft fuer Mathematik und Datenverabeitung, Augustin, Germany
fYear :
1993
fDate :
10/6/1993 12:00:00 AM
Firstpage :
121
Lastpage :
126
Abstract :
Logs are an important facility for fault-tolerant distributed systems since they allow to reliably store information that is needed to provide a global consistent system state also in the presence of failures. The authors focus on the problem of fast recovery after a node crash. The approach is mainly based on minimizing the number of log records to be retrieved. This is achieved by periodically discarding obsolete information in a very efficient manner without effecting the normal logging procedure. The main idea behind is that the generic Log-Service provides a high level interface to the application which allows the Log-Service itself to interpret the semantics of log records without consulting the application during run-time. In addition, the authors are able to reduce the overhead in analyzing the log contents during restart by scanning the log only once and only forward
Keywords :
distributed processing; fault tolerant computing; finite state machines; system recovery; distributed fault-tolerant systems; distributed systems; fast recovery; log records; log-service; Automata; Computer crashes; Error correction; Fault tolerant systems; History; Protocols; Runtime;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Parallel and Distributed Systems, 1993., Proceedings of the IEEE Workshop on
Conference_Location :
Princeton, NJ
Print_ISBN :
0-8186-5250-0
Type :
conf
DOI :
10.1109/APADS.1993.588858
Filename :
588858
Link To Document :
بازگشت