DocumentCode
2377786
Title
Rejuvenation and failure detection in partitionable systems
Author
Fetzer, Christof ; Högstedt, Karin
fYear
2001
fDate
2001
Firstpage
154
Lastpage
161
Abstract
Certain gateways (e.g., some cable or DSL modems) are known to have low reliability and low availability. Most failures of these devices can however be "fixed" by rejuvenating the device after a failure has been detected. Such a detection based rejuvenation strategy permits increasing the availability of these gateways. In the considered scenario, rejuvenation is non-trivial since a failure of such a gateway will leave it partitioned away from the network. In particular, network operators that want to rejuvenate these gateways are in a different network partition, and can therefore not initiate a remote rejuvenation. In this paper we propose a failure detection based rejuvenation service and a remote detection service. The rejuvenation service detects and fixes "soft" failures automatically (in one partition), and the detection service detects (in another partition) all rejuvenations exactly once, within a bounded amount of time, even when the gateway is rejuvenated consecutively. The detection service also allows the detection of "hard" failures, and filtering of notifications of soft failures
Keywords
computer network management; fault tolerant computing; internetworking; local area networks; availability; distributed systems; failure detection; fault-tolerant systems; gateways; home networking; local area network; local area system; network manager; rejuvenation service; reliability; remote detection; remote system management; Availability; Costs; DSL; Fault detection; IP networks; Internet; Local area networks; Modems; Protocols; Software maintenance;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable Computing, 2001. Proceedings. 2001 Pacific Rim International Symposium on
Conference_Location
Seoul
Print_ISBN
0-7695-1414-6
Type
conf
DOI
10.1109/PRDC.2001.992692
Filename
992692
Link To Document