Title :
On the pointwise redundancy of the LZ78 algorithm
Author :
Yang, En-Hui ; Song, Lihua ; Shamir, Gil I. ; Kieffer, John C.
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont.
Abstract :
The redundancy rate of the Lempel-Ziv algorithm has been widely investigated. Much of the data compression community believed that the redundancy rate of the LZ78 algorithm should be O((log n)-1), where n is the data length. However, until the present paper, this conjecture had not been proved for sources beyond Markov sources. In this paper, we investigate the upper bound on the pointwise redundancy rate of the Lempel-Ziv algorithm for mixing sources and finite-state sources. The technique we applied in this paper is simple. By studying the dictionary tree resulting from the LZ78 algorithm, we derive certain relationships between the self-information of a sequence emitted by a source and the number of phrases resulting from the LZ78 parsing of the sequence. From these relationships, upper bounds on the pointwise redundancy rate of the LZ78 algorithm on mixing sources and finite-state sources can be obtained. These results show that for mixing sources and finite-state sources, the pointwise redundancy rate is upper bounded by O((log n)-1) for the LZ78 algorithm. We also compare our results with previous results of Savari and Kieffer-Yang
Keywords :
computational complexity; data compression; finite state machines; Lempel-Ziv algorithm; data compression; data length; finite-state sources; mixing sources; pointwise redundancy rate; Collaborative work; Councils; Data compression; Dictionaries; Encoding; Entropy; Gas insulated transmission lines; Information technology; Research and development; Upper bound;
Conference_Titel :
Information Theory, 2005. ISIT 2005. Proceedings. International Symposium on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-9151-9
DOI :
10.1109/ISIT.2005.1523384