DocumentCode :
1102868
Title :
Election in asynchronous complete networks with intermittent link failures
Author :
Abu-Amara, Hosame ; Lokre, Jahnavi
Author_Institution :
Dept. of Electr. Eng., Texas A&M Univ., College Station, TX, USA
Volume :
43
Issue :
7
fYear :
1994
fDate :
7/1/1994 12:00:00 AM
Firstpage :
778
Lastpage :
788
Abstract :
Considers the problem of fault-tolerant leader election in asynchronous complete (fully-connected) distributed networks. The processors are reliable, but some of the communication channels may fail intermittently before or during the execution of the algorithm. Channel failures are undetectable due to the asynchronous nature of the network. Let n be the number of processors in the network and f be the maximum number of faulty channels incident on each processor, where f⩽½[n-1]. Our algorithm uses at most O(n2+nf2 ) messages to elect a unique leader of the network. Each message consists of at most O(log|T|) bits, where |T| is the cardinality of the set of processor identifiers. All previous algorithms either tolerated only benign failures such as fail-stop failures, assumed that the network is synchronous, tolerated only a small number of failures, or assumed that the faults are detectable. Our algorithm is the first election algorithm that is designed specifically for asynchronous intermittently faulty complete networks in which up to ¼n[n-1] channels may be faulty, where each processor is adjacent to no more than ½[n-1] faulty channels, and where the faults are undetectable
Keywords :
communication complexity; failure analysis; fault tolerant computing; reliability; telecommunication channels; telecommunication network management; algorithm execution; asynchronous complete networks; communication channel failures; distributed algorithms; fault-tolerant leader election; faulty channels; fully-connected distributed networks; intermittent link failures; message complexity; processor identifier set cardinality; reliable processors; undetectable faults; Algorithm design and analysis; Communication channels; Distributed algorithms; Fault detection; Fault tolerance; File systems; Intelligent networks; Joining processes; Nominations and elections; Testing;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/12.293257
Filename :
293257
Link To Document :
بازگشت