Title :
CDRM: A Cost-Effective Dynamic Replication Management Scheme for Cloud Storage Cluster
Author :
Wei, Qingsong ; Veeravalli, Bharadwaj ; Gong, Bozhao ; Zeng, Lingfang ; Feng, Dan
Author_Institution :
Data Storage Inst., A*STAR, Singapore, Singapore
Abstract :
Data replication has been widely used as a mean of increasing the data availability of large-scale cloud storage systems where failures are normal. Aiming to provide cost-effective availability, and improve performance and load-balancing of cloud storage, this paper presents a cost-effective dynamic replication management scheme referred to as CDRM. A novel model is proposed to capture the relationship between availability and replica number. CDRM leverages this model to calculate and maintain minimal replica number for a given availability requirement. Replica placement is based on capacity and blocking probability of data nodes. By adjusting replica number and location according to workload changing and node capacity, CDRM can dynamically redistribute workloads among data nodes in the heterogeneous cloud. We implemented CDRM in Hadoop Distributed File System (HDFS) and experiment results conclusively demonstrate that our CDRM is cost effective and outperforms default replication management of HDFS in terms of performance and load balancing for large-scale cloud storage.
Keywords :
distributed processing; probability; resource allocation; storage management; Hadoop distributed file system; blocking probability; cloud storage cluster; cost-effective dynamic replication management scheme; data replication; heterogeneous cloud; load balancing; replica placement; Availability; Bandwidth; Clouds; Distributed databases; File systems; Parallel processing; Silicon; Cost-effective; Dynamic Replication Management; HDFS; Load Balance;
Conference_Titel :
Cluster Computing (CLUSTER), 2010 IEEE International Conference on
Conference_Location :
Heraklion, Crete
Print_ISBN :
978-1-4244-8373-0
Electronic_ISBN :
978-0-7695-4220-1
DOI :
10.1109/CLUSTER.2010.24