Title :
2W-FD: A Failure Detector Algorithm with QoS
Author :
Tomsic, Alejandro Z. ; Sens, Pierre ; Garcia, Joao ; Arantes, Luciana ; Sopena, Julien
Author_Institution :
Inria, UPMC Univ. Paris 06, Paris, France
Abstract :
Failure detection plays a central role in the engineering of distributed systems. Furthermore, many applications have timing constraints and require failure detectors that provide quality of service (QoS) with some quantitative timeliness guarantees. Therefore, they need failure detectors that are fast and accurate. We introduce the Two-Windows Failure Detector (2W-FD), an algorithm able to react to sudden changes in network conditions, property that currently existing algorithms do not satisfy. We ran tests on real traces and compared the 2W-FD to state-of-art algorithms. Our results show that our algorithm presents the best performance in terms of speed and accuracy in unstable scenarios.
Keywords :
distributed processing; quality of service; software fault tolerance; 2W-FD algorithm; QoS; distributed systems; failure detection; quality of service; quantitative timeliness guarantees; two-windows failure detector algorithm; Computers; Delays; Detectors; Estimation; Heart beat; Quality of service; Distributed Algorithms; Failure Detectors; Fault Tolerance; Quality of Service; Quiescence; Reliability;
Conference_Titel :
Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International
Conference_Location :
Hyderabad
DOI :
10.1109/IPDPS.2015.74