Title :
RFH: A Resilient, Fault-Tolerant and High-Efficient Replication Algorithm for Distributed Cloud Storage
Author :
Qu, Yanzhen ; Xiong, Naixue
Author_Institution :
Sch. of Comput. Sci., Colorado Tech. Univ., CO, USA
Abstract :
To avoid failure and achieve higher availability, replication scheme is now widely used in distributed Cloud storage systems [25]. However, most of them only statically replicate data on some randomly chosen nodes for a fixed number of times and it is obviously not enough for more reasonable resource allocation. Moreover, query load for Web application is highly irregular. It throws us into a dilemma to always maintain maximum number of replicas in case of explosive query load outburst or save resources with fewer replicas at the expense of performance. In this paper, we present a Resilient, Fault-tolerant and High-efficient global replication algorithm (RFH) for distributed Cloud storage systems. RFHis especially efficient facing ´flash crowd´ problem. Each data partition is represented by a virtual node. Each virtual node itself decides whether to replicate, migrate or suicide by weighing up the pros and cons. It is based on the evaluation of traffic load of all nodes, and selects among physical nodes with the most traffic (traffic hub) to replicate or migrate on. After that, it takes into account blocking probability to achieve quicker response and better load balance performance. Extensive simulations have been conducted and the results have demonstrated that the proposed scheme RFH outperforms the main existing algorithms the request-oriented algorithms[16] [5], the owner-oriented algorithms [7] [11] [12] [13] and the random algorithms [4] [21] [22] in terms of high replica utilization rate, high query efficiency and reasonable path length at a low cost while maintaining high availability.
Keywords :
cloud computing; probability; storage management; RFH; Web application; blocking probability; distributed cloud storage systems; fault-tolerant replication algorithm; flash crowd problem; high-efficient replication algorithm; owner-oriented algorithms; query load outburst; request-oriented algorithms; resilient replication algorithm; resource allocation; Algorithm design and analysis; Availability; Bismuth; Cloud computing; Partitioning algorithms; Routing; Servers; Data replication; Distributed Cloud storage; Fault-tolerance; High-efficient;
Conference_Titel :
Parallel Processing (ICPP), 2012 41st International Conference on
Conference_Location :
Pittsburgh, PA
Print_ISBN :
978-1-4673-2508-0
DOI :
10.1109/ICPP.2012.3