Title :
Compression of low entropy strings with Lempel-Ziv algorithms
Author :
Kosaraju, S. Rao ; Manzini, Giovanni
Author_Institution :
Dept. of Comput. Sci., Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
We compare the compression ratio of the Lempel-Ziv algorithms with the empirical entropy of the input string, We show that although these algorithms are optimal according to the generally accepted definition, we can find families of low entropy strings which are not compressed optimally. More precisely, we show that the compression ratio achieved by LZ78 (resp. LZ77) can be much higher than the zeroth order entropy H 0 (resp. the first order entropy H1). We present a compression algorithm which combines LZ78 with run length encoding, and we show that for any string s the new algorithm achieves a compression ratio bounded by 3H0(s). Finally, we prove that for any string s the compression ratio achieved by LZ77 is bounded by 8H0 (s)
Keywords :
data compression; entropy; runlength codes; LZ77 algorithm; LZ78 algorithm; Lempel-Ziv algorithms; compression ratio; data compression; low entropy strings; run length encoding; Algorithm design and analysis; Compression algorithms; Computer science; Data compression; Decoding; Encoding; Entropy; Performance analysis; Probability distribution; Random sequences;
Conference_Titel :
Compression and Complexity of Sequences 1997. Proceedings
Conference_Location :
Salerno
Print_ISBN :
0-8186-8132-2
DOI :
10.1109/SEQUEN.1997.666907