Title :
Accelerating duplicate data chunk recognition using NN trained by locality-sensitive hash
Author :
Berman, Amit ; Birk, Yitzhak ; Mendelson, Avi
Author_Institution :
Electr. Eng. Dept., Technion - Israel Inst. of Technol., Haifa, Israel
Abstract :
Deduplication is often used in storage systems in order to save storage space, communication bandwidth, write energy, and recovery and error-protection infrastructure. However, deduplication overhead increases latency and computation energy. Determining whether a data chunk is already stored by comparing signatures constitutes a significant fraction of this deduplication overhead. In this paper, we propose a statistical chunk classifier based on a neural network. Our technique is based on learning the patterns of locality-sensitive hashing of the data. Our experiments show an acceleration of chunk processing, leading to reduction in deduplication overhead.
Keywords :
file organisation; neural nets; pattern classification; NN; communication bandwidth; computation energy; deduplication overhead; duplicate data chunk recognition; error-protection infrastructure; locality-sensitive hash; locality-sensitive hashing; neural network; statistical chunk classifier; storage systems; write energy; Acceleration; Artificial neural networks; Biological neural networks; Computer architecture; Neurons; Training; Chunking; Cloud Storage; Deduplication; Locality-Sensitive Hashing; Machine Learning; Neural Network;
Conference_Titel :
Electrical & Electronics Engineers in Israel (IEEEI), 2014 IEEE 28th Convention of
Conference_Location :
Eilat
Print_ISBN :
978-1-4799-5987-7
DOI :
10.1109/EEEI.2014.7005887