• DocumentCode
    3348915
  • Title

    An (N-1)-resilient algorithm for distributed termination detection

  • Author

    Lai, Ten-Hwang ; Wu, Li-Fen

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
  • fYear
    1992
  • fDate
    1-4 Dec 1992
  • Firstpage
    274
  • Lastpage
    281
  • Abstract
    The authors propose a fault-tolerant algorithm for termination detection that can tolerate any number of stop failures. This algorithm improves on existing ones in many aspects, including worst-case message complexity, average-case message complexity and storage overhead. Most important, it runs as efficiently as the best non-fault-tolerant algorithm available if no process fails during the computation, and incurs only a reasonable amount of cost for each process failure that actually occurs. The main idea of the algorithm is a barrier set at a particular process. It is possible that other termination detection algorithms can be made fault-tolerant using similar techniques
  • Keywords
    communication complexity; fault tolerant computing; program verification; average-case message complexity; distributed termination detection; fault-tolerant algorithm; storage overhead; worst-case message complexity; Communication networks; Costs; Detectors; Distributed algorithms; Distributed computing; Face detection; Fault detection; Fault tolerance; Fault tolerant systems; Information science;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing, 1992. Proceedings of the Fourth IEEE Symposium on
  • Conference_Location
    Arlington, TX
  • Print_ISBN
    0-8186-3200-3
  • Type

    conf

  • DOI
    10.1109/SPDP.1992.242734
  • Filename
    242734