DocumentCode
153245
Title
A Fault-Tolerant Strategy of Redeploying the Lost Replicas in Cloud
Author
Ning Wang ; Yang Yang ; Zhenqiang Mi ; Qing Ji ; Kun Meng
Author_Institution
Sch. of Comput. & Commun. Eng., Univ. of Sci. & Technol. Beijing, Beijing, China
fYear
2014
fDate
7-11 April 2014
Firstpage
370
Lastpage
375
Abstract
In cloud storage centers, replica of file may be lost subjected to the failure of nodes, which will affect the efficiency of file access, as well as users´ satisfaction. To cope with this problem, the method of redeploying the lost replicas on some other servers to maintain system availability is often adopted. Normally, a file is divided into many blocks with the same size and the popularity of the blocks is different in cloud storage system, which could be used as a parameter in deploying replicas. Therefore, in this paper, Scarlett system is utilized to determine the optimal number of block replica based on the block popularity. Then, considering the system load, the total cost and quality of services, we present a selective data recovery method subjected to the failure of nodes. In the meantime, a cost-efficient replicas deployment strategy, namely CERD, is provided. The strategy has been verified in HDFS. Finally, we simulate the environment with random cloud node failure, and compare our strategy with the strategies of Hadoop default. The results verify that CERD strategy can balance the load of the whole system, reduce the total cost of service, as well as provide higher service quality, which are consistent with the theoretical analysis.
Keywords
cloud computing; fault tolerant computing; replicated databases; storage management; CERD; HDFS; Hadoop default; Scarlett system; block popularity; block replica; cloud storage centers; cloud storage system; cost-efficient replicas deployment strategy; data recovery method; fault-tolerant strategy; file access; file replica; lost replicas; quality of services; random cloud node failure; system availability; system load; user satisfaction; Availability; Cloud computing; Fault tolerance; Optimization; Quality of service; Time factors; node failure; cloud computing; HDFS; service cost; CERD;
fLanguage
English
Publisher
ieee
Conference_Titel
Service Oriented System Engineering (SOSE), 2014 IEEE 8th International Symposium on
Conference_Location
Oxford
Type
conf
DOI
10.1109/SOSE.2014.62
Filename
6830932
Link To Document