Title :
A novel self-regulatory failure detection algorithm for distributed storage system
Author :
Peng Guo-jian ; Chen Guo-min ; Liu Ze-hua ; Luo Chen-hui
Author_Institution :
Coll. of Econ. & Bus. Adm., Univ. of South China, Hengyang, China
Abstract :
Distributed storage systems have the advantages of self-organizing, scalability, robust and fault-tolerant, which provide fundamental support for mission-critical applications. Unfortunately, there still exist some problems in real network conditions, especially, when considering asynchronous distributed environments, where messages may be delayed indefinitely and nodes may fail. So, failure detector is one of basic components to build a reliable distributed storage system. Considering the highly dynamic characteristics of distributed system, a novel self-regulatory failure detection algorithm (SFDA), which can combine heartbeat strategy with unbiased grey prediction model, is designed to improve the failure detection quality of system (QoS) according to the application needs and network environment changes. The key goal of using correction value in SFDA is to provide adoptable and usable service in all kinds of applications. The results show that, failure detector based on SFDA is more effective and flexible than other failure detectors.
Keywords :
distributed processing; grey systems; reliability; storage management; system recovery; SFDA; correction value; distributed storage system; failure QoS; failure detection quality of system; failure detector; heartbeat strategy; network environment changes; self-regulatory failure detection algorithm; unbiased grey prediction model; Computer crashes; Data models; Detectors; Heart beat; Monitoring; Predictive models; Quality of service; Failure detector; Global stabilization time; Self-regulatory; Storage system;
Conference_Titel :
Instrumentation and Measurement, Sensor Network and Automation (IMSNA), 2013 2nd International Symposium on
Conference_Location :
Toronto, ON
DOI :
10.1109/IMSNA.2013.6743366