DocumentCode
653511
Title
Data Deduplication Cluster Based on Similarity-Locality Approach
Author
Xingyu Zhang ; Jian Zhang
Author_Institution
Sch. of Comput. Eng. & Sci., Shanghai Univ., Shanghai, China
fYear
2013
fDate
20-23 Aug. 2013
Firstpage
2168
Lastpage
2172
Abstract
Human beings have entered the big data era, and the growing data bring huge challenges for data storage. Existing deduplication methods do not work adequately in many situations. Recently, the data deduplication cluster has become an important need of most commercial and research backup systems. Data deduplication cluster becomes popular in storage system for data backup and archiving. Many researchers focus on deduplication cluster by which to reduce more redundant data. Especially block level deduplication cluster becomes popular. It is concerned to have two challenges: the chunk-lookup disk bottleneck problem and the data routing problem. A new solution is proposed for chunk-lookup disk bottleneck in our paper. The Approach of combining similarity with locality is applied to the deduplication cluster. At the same time, the bloom filter algorithm storing fingerprint is used to find more duplicate data between nodes. The system architecture and the details are provided. Finally, the experiment shows the system has a good performance.
Keywords
Big Data; data compression; data structures; storage management; big data era; block level deduplication cluster; bloom filter algorithm; chunk-lookup disk bottleneck problem; data archiving; data backup systems; data deduplication cluster; data routing problem; data storage; similarity-locality approach; Filtering algorithms; Fingerprint recognition; Information filters; Information management; Servers; Throughput; bloom filter; cluster; deduplication; locality; similarity;
fLanguage
English
Publisher
ieee
Conference_Titel
Green Computing and Communications (GreenCom), 2013 IEEE and Internet of Things (iThings/CPSCom), IEEE International Conference on and IEEE Cyber, Physical and Social Computing
Conference_Location
Beijing
Type
conf
DOI
10.1109/GreenCom-iThings-CPSCom.2013.409
Filename
6682419
Link To Document