Title :
The redundancy and distribution of the phrase lengths of the fixed-database Lempel-Ziv algorithm
Author :
Wyner, Abraham J.
Author_Institution :
Dept. of Stat., California Univ., Berkeley, CA, USA
fDate :
9/1/1997 12:00:00 AM
Abstract :
The fixed-database version of the Lempel-Ziv algorithm closely resembles many versions that appear in practice. We ascertain several key asymptotic properties of the algorithm as applied to sources with finite memory. First, we determine that for a dictionary of size n, the algorithm achieves a redundancy ρn=Hlog log n/log n+0(log log n/log n) where H is the entropy of the process. This is the first, nontrivial, lower bound on any Lempel-Ziv-type compression scheme. We then find the limiting distribution and all moments of the lengths of the phrases by comparing them to a random-walk-like variable with well-known behavior
Keywords :
data compression; encoding; entropy; random processes; Lempel-Ziv-type compression; asymptotic properties; dictionary size; entropy; finite memory sources; fixed database Lempel-Ziv algorithm; limiting distribution; moments; nontrivial lower bound; phrase lengths distribution; phrase lengths redundancy; random walk like variable; Data compression; Dictionaries; Encoding; Entropy; Helium; Information theory; Loss measurement; Random sequences; Statistics; Upper bound;
Journal_Title :
Information Theory, IEEE Transactions on