Title :
ALTER: adaptive failure detection services for grids
Author :
Shi, Xuanhua ; Jin, Hai ; Han, Zongfen ; Qiang, Weizhong ; Wu, Song ; Zou, Deqing
Author_Institution :
Huazhong Univ. of Sci. & Technol., Wuhan, China
Abstract :
This paper presents an adaptive failure detection service (ALTER), which incorporates the technique of unreliable failure detection service and the idea of R-GMA. ALTER is organized in a hierarchical structure. It can be adaptive to the system conditions and user requirements with changing the system parameters and system organizations. With experimental evaluation, ALTER shows good scalability and flexibility.
Keywords :
fault tolerant computing; grid computing; system recovery; ALTER; R-GMA; adaptive failure detection service; grid; hierarchical structure; unreliable failure detection service; Adaptive systems; Computer crashes; Condition monitoring; Detectors; Fault detection; Global communication; Local area networks; Object detection; Protocols; Scalability;
Conference_Titel :
Services Computing, 2005 IEEE International Conference on
Print_ISBN :
0-7695-2408-7
DOI :
10.1109/SCC.2005.23