Title :
A Parallel Architecture for In-Line Data De-duplication
Author :
Sengar, Seetendra Singh ; Mishra, Manoj
Author_Institution :
Dept. of Electron. & Comput. Eng., Indian Inst. of Technol., Roorkee, India
Abstract :
Recently, data de-duplication, the hot emerging technology, has received a broad attention from both academia and industry. Some researches focus on the approach by which more redundant data can be reduced and others investigate how to do data de-duplication at high speed. In this paper, we show the importance of data de-duplication in the current digital world and aim at reducing the time and space requirement for data de-duplication. Then, we present a parallel architecture with one node designated as a server and multiple storage nodes. All the nodes, including the server, can do block level in-line de-duplication in parallel. We have built a prototype of the system and present some performance results. The proposed system uses magnetic disks as a storage technology.
Keywords :
data compression; parallel architectures; in-line data de-duplication; parallel architecture; redundant data; Computer architecture; Databases; Electronic mail; Industries; Java; Redundancy; Servers; cluster; data de-duplication; hash signature; in-line de-duplication; load sharing;
Conference_Titel :
Advanced Computing & Communication Technologies (ACCT), 2012 Second International Conference on
Conference_Location :
Rohtak, Haryana
Print_ISBN :
978-1-4673-0471-9
DOI :
10.1109/ACCT.2012.10