DocumentCode
2204868
Title
Hardware accelerator for similarity based data dedupe
Author
Dongyang Li ; Qingbo Wang ; Guyot, Cyril ; Narasimha, Ashwin ; Vucinic, Dejan ; Bandic, Zvonimir ; Qing Yang
Author_Institution
Dept, of Electrical, Computer, and Biomedical Engineering, Univ of Rhode Island, Kingston, 02881, USA
fYear
2015
fDate
6-7 Aug. 2015
Firstpage
224
Lastpage
232
Abstract
Data deduplication has proven important in backup storage systems as large amount of identical or similar data chunks exist. Recent studies have shown the great potential of data deduplication in primary storage and storage caches. Deduplications in these environments require high speed processing not to drag down production performance. This paper presents a hardware accelerator for similarity based data deduplication. It implements three compute-intensive kernel modules to improve throughput and latency in dedupe systems: sketch computation for data blocks, index searching for reference block, and delta encoding over similar blocks. Adopting pipelined computation and parallel data lookup across multiple hardware modules, our HW design is capable of processing high throughput data traffic by working on multiple data units concurrently, thus enabling wire speed dedupe for data stream where similar blocks present. Using a PC host system connected to the FPGA-based accelerator through a PCIe Gen 2×4 interface, our experiments show that the similarity based data dedupe performs 30% better in data reduction ratio than conventional dedupe techniques that look at identical blocks only. By comparing the hardware implementation with its software counterpart, the experimental results show that our preliminary FPGA implementation with maximum clock speed of 250MHz achieves at least 6 times improvement in latency over the software implementation running on state-of-art servers.
Keywords
Cache; Data deduplication; Fingerprinting; PCIe; multiple lookup dictionary;
fLanguage
English
Publisher
ieee
Conference_Titel
Networking, Architecture and Storage (NAS), 2015 IEEE International Conference on
Conference_Location
Boston, MA, USA
Type
conf
DOI
10.1109/NAS.2015.7255198
Filename
7255198
Link To Document