Title :
ADAPTATION - Algorithms to Adaptive Fault Monitoring and their implementation on CORBA
Author :
Sotoma, Irineu ; Madeira, Edmundo Roberto Mauro
Author_Institution :
Univ. Estadual de Campinas, Sao Paulo, Brazil
fDate :
6/23/1905 12:00:00 AM
Abstract :
This paper presents ADAPTATION - Algorithms to Adaptive Fault Monitoring for asynchronous distributed systems and their implementation on CORBA. Our algorithms vary the timeouts based on a recent history of last elapsed times of the monitoring messages. The aim of the proposed algorithms is to provide a better response time to crashes and a minimum discrepancy between a suspection due to the network overload and due to the real process crash. The proposed approach extends the Fault Tolerant CORBA OMG specification with the push model and the definition of pull and push ADAPTATION fault monitors. Some ADAPTATION experiments on ACE+TAO were made to observe their behavior on changing network workloads
Keywords :
distributed object management; software fault tolerance; system monitoring; system recovery; ACE+TAO; ADAPTATION; ADAPTATION fault monitors; CORBA; Fault Tolerant CORBA OMG specification; algorithms to adaptive fault monitoring; asynchronous distributed systems; last elapsed times; monitoring messages; network overload; network workloads; push model; real process crash; response time; timeouts; Algorithm design and analysis; Computer crashes; Condition monitoring; Delay effects; Detectors; Distributed computing; Fault detection; Fault tolerance; History; Object detection;
Conference_Titel :
Distributed Objects and Applications, 2001. DOA '01. Proceedings. 3rd International Symposium on
Conference_Location :
Rome
Print_ISBN :
0-7695-1300-X
DOI :
10.1109/DOA.2001.954087