DocumentCode :
2684781
Title :
Offline dictionary-based compression
Author :
Larsson, N. Jesper ; Moffat, Alistair
Author_Institution :
Dept. of Comput. Sci., Lund Univ., Sweden
fYear :
1999
fDate :
29-31 Mar 1999
Firstpage :
296
Lastpage :
305
Abstract :
Dictionary-based modelling is the mechanism used in many practical compression schemes. We use the full message (or a large block of it) to infer a complete dictionary in advance, and include an explicit representation of the dictionary as part of the compressed message. Intuitively, the advantage of this offline approach is that with the benefit of having access to all of the message, it should be possible to optimize the choice of phrases so as to maximize compression performance. Indeed, we demonstrate that very good compression can be attained by an offline method without compromising the fast decoding that is a distinguishing characteristic of dictionary-based techniques. Several nontrivial sources of overhead, in terms of both computation resources required to perform the compression, and bits generated into the compressed message, have to be carefully managed as part of the offline process. To meet this challenge, we have developed a novel phrase derivation method and a compact dictionary encoding. In combination these two techniques produce the compression scheme RE-PAIR, which is highly efficient, particularly in decompression
Keywords :
data compression; decoding; encoding; optimisation; text analysis; RE-PAIR; compact dictionary encoding; decoding; decompression; dictionary-based modelling; maximization; message; offline compression; optimization; phrase derivation method; Decoding; Dictionaries; Encoding; Entropy; Frequency; Proposals; Resource management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Compression Conference, 1999. Proceedings. DCC '99
Conference_Location :
Snowbird, UT
ISSN :
1068-0314
Print_ISBN :
0-7695-0096-X
Type :
conf
DOI :
10.1109/DCC.1999.755679
Filename :
755679
Link To Document :
بازگشت