DocumentCode
172978
Title
Using Elasticity to Improve Inline Data Deduplication Storage Systems
Author
Yufeng Wang ; Tan, C.C. ; Ningfang Mi
Author_Institution
Temple Univ., Philadelphia, PA, USA
fYear
2014
fDate
June 27 2014-July 2 2014
Firstpage
785
Lastpage
792
Abstract
Elasticity is the ability to scale computing resources such as memory on-demand, and is one of the main advantages of utilizing cloud computing services. With the increasing popularity of cloud based storage, it is natural that more deduplication based storage systems will be migrated to the cloud. Existing deduplication systems however, do not adequately take advantage of elasticity. In this paper, we illustrate how to use elasticity to improve deduplication based systems, and propose EAD (elasticity aware deduplication), an indexing algorithm that uses the ability to dynamically increase memory resources to improve overall deduplication performance. Our experimental results indicate that EAD is able to detect more than 98% of all duplicate data, however only consumes less than 5% of expected memory space. Meanwhile, it claims four times of deduplication efficiency than the state-of-art sampling technique while costs less than half of the amount of memory.
Keywords
cloud computing; data handling; database indexing; storage management; EAD; cloud based storage; cloud computing services; elasticity aware deduplication; indexing algorithm; inline data deduplication storage systems; memory on-demand; Elasticity; Estimation; Heuristic algorithms; Indexes; Memory management; Random access memory; Servers;
fLanguage
English
Publisher
ieee
Conference_Titel
Cloud Computing (CLOUD), 2014 IEEE 7th International Conference on
Conference_Location
Anchorage, AK
Print_ISBN
978-1-4799-5062-1
Type
conf
DOI
10.1109/CLOUD.2014.109
Filename
6973815
Link To Document