DocumentCode
3348915
Title
An (N -1)-resilient algorithm for distributed termination detection
Author
Lai, Ten-Hwang ; Wu, Li-Fen
Author_Institution
Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
fYear
1992
fDate
1-4 Dec 1992
Firstpage
274
Lastpage
281
Abstract
The authors propose a fault-tolerant algorithm for termination detection that can tolerate any number of stop failures. This algorithm improves on existing ones in many aspects, including worst-case message complexity, average-case message complexity and storage overhead. Most important, it runs as efficiently as the best non-fault-tolerant algorithm available if no process fails during the computation, and incurs only a reasonable amount of cost for each process failure that actually occurs. The main idea of the algorithm is a barrier set at a particular process. It is possible that other termination detection algorithms can be made fault-tolerant using similar techniques
Keywords
communication complexity; fault tolerant computing; program verification; average-case message complexity; distributed termination detection; fault-tolerant algorithm; storage overhead; worst-case message complexity; Communication networks; Costs; Detectors; Distributed algorithms; Distributed computing; Face detection; Fault detection; Fault tolerance; Fault tolerant systems; Information science;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing, 1992. Proceedings of the Fourth IEEE Symposium on
Conference_Location
Arlington, TX
Print_ISBN
0-8186-3200-3
Type
conf
DOI
10.1109/SPDP.1992.242734
Filename
242734
Link To Document