Title :
Using Elasticity to Improve Inline Data Deduplication Storage Systems
Author :
Yufeng Wang ; Tan, C.C. ; Ningfang Mi
Author_Institution :
Temple Univ., Philadelphia, PA, USA
fDate :
June 27 2014-July 2 2014
Abstract :
Elasticity is the ability to scale computing resources such as memory on-demand, and is one of the main advantages of utilizing cloud computing services. With the increasing popularity of cloud based storage, it is natural that more deduplication based storage systems will be migrated to the cloud. Existing deduplication systems however, do not adequately take advantage of elasticity. In this paper, we illustrate how to use elasticity to improve deduplication based systems, and propose EAD (elasticity aware deduplication), an indexing algorithm that uses the ability to dynamically increase memory resources to improve overall deduplication performance. Our experimental results indicate that EAD is able to detect more than 98% of all duplicate data, however only consumes less than 5% of expected memory space. Meanwhile, it claims four times of deduplication efficiency than the state-of-art sampling technique while costs less than half of the amount of memory.
Keywords :
cloud computing; data handling; database indexing; storage management; EAD; cloud based storage; cloud computing services; elasticity aware deduplication; indexing algorithm; inline data deduplication storage systems; memory on-demand; Elasticity; Estimation; Heuristic algorithms; Indexes; Memory management; Random access memory; Servers;
Conference_Titel :
Cloud Computing (CLOUD), 2014 IEEE 7th International Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4799-5062-1
DOI :
10.1109/CLOUD.2014.109