Title :
QoS of timeout-based self-tuned failure detectors: the effects of the communication delay predictor and the safety margin
Author :
Nunes, Raul Ceretta ; Jansch-Pôrto, Ingrid
Author_Institution :
Dept. of Electron. & Comput., Fed. Univ. of Santa Maria, Brazil
fDate :
28 June-1 July 2004
Abstract :
Unreliable failure detectors have been an important abstraction to build dependable distributed applications over asynchronous distributed systems subject to faults. Their implementations are commonly based on timeouts to ensure algorithm termination. However, for systems built on the Internet, it is hard to estimate this time value due to traffic variations. Thus, different types of predictors have been used to model this behavior and make predictions of delays. In order to increase the quality of service (QoS), self-tuned failure detectors dynamically adapt their timeouts to the communication delay behavior added of a safety margin. In this paper, we evaluate the QoS of a failure detector for different combinations of communication delay predictors and safety margins. As the results show, to improve the QoS, one must consider the relation between the pair predictor/margin, instead of each one separately. Furthermore, performance and accuracy requirements should be considered for a suitable relationship.
Keywords :
Internet; delays; quality of service; software fault tolerance; system recovery; telecommunication traffic; Internet; QoS; asynchronous distributed systems; communication delay predictor; communication traffic; quality-of-service; self-tuned failure detection; timeout-based failure detection; Application software; Computer crashes; Computer science; Delay effects; Detectors; Distributed computing; Fault detection; Protocols; Quality of service; Safety;
Conference_Titel :
Dependable Systems and Networks, 2004 International Conference on
Print_ISBN :
0-7695-2052-9
DOI :
10.1109/DSN.2004.1311946