DocumentCode :
2743192
Title :
An approximation to the greedy algorithm for differential compression of very large files
Author :
Agarwal, Ramesh C. ; Amalapurapu, Suchitra ; Jain, Shaili
Author_Institution :
IBM Almaden Res. Center, San Jose, CA, USA
fYear :
2004
fDate :
23-25 March 2004
Firstpage :
523
Abstract :
This paper presents a new differential compression algorithm that combines the hash value and suffix array technique. In this algorithm, hash values for every block of the reference file is computed. Next, suffix arrays on these block hash values are computed. This algorithm finds the longest matches for every offset of the version file. This algorithm depends upon the utilization of three new data structures, the block hash table, the quick index array, and the pointer array, which improves the run-time of the algorithm, and compress very large files.
Keywords :
algorithm theory; block codes; data compression; data structures; file organisation; algorithm run-time; block hash table; data structure; differential compression algorithm; greedy algorithm; hash value; large file compression; pointer array; quick index array; reference file computation; suffix array technique; version file matching; Compression algorithms; Computational biology; Data compression; Data structures; Encoding; Greedy algorithms; Runtime;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Compression Conference, 2004. Proceedings. DCC 2004
ISSN :
1068-0314
Print_ISBN :
0-7695-2082-0
Type :
conf
DOI :
10.1109/DCC.2004.1281499
Filename :
1281499
Link To Document :
بازگشت