DocumentCode :
1254101
Title :
On the performance of data compression algorithms based upon string matching
Author :
Yang, En-Hui ; Kieffer, John C.
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
Volume :
44
Issue :
1
fYear :
1998
fDate :
1/1/1998 12:00:00 AM
Firstpage :
47
Lastpage :
65
Abstract :
Lossless and lossy data compression algorithms based on string matching are considered. In the lossless case, a result of Wyner and Ziv (1989) is extended. In the lossy case, a data compression algorithm based on approximate string matching is analyzed in the following two frameworks: (1) the database and the source together form a Markov chain of finite order; (2) the database and the source are independent with the database coming from a Markov model and the source from a general stationary, ergodic model. In either framework, it is shown that the resulting compression rate converges with probability one to a quantity computable as the infimum of an information theoretic functional over a set of auxiliary random variables; the quantity is strictly greater than the rate distortion function of the source except in some symmetric cases. In particular, this result implies that the lossy algorithm proposed by Steinberg and Gutman (1993) is not optimal, even for memoryless or Markov sources
Keywords :
Markov processes; convergence of numerical methods; data compression; functional analysis; rate distortion theory; source coding; string matching; Markov model; Markov sources; approximate string matching; auxiliary random variables; compression rate convergence; database; ergodic mode; finite order Markov chain; information theoretic functional; lossless data compression algorithms; lossy data compression algorithms; memoryless sources; rate distortion function; source coding; stationary model; symmetric cases; Algorithm design and analysis; Communication system control; Data compression; Databases; Decoding; Encoding; Information theory; Random variables; Rate-distortion; Terrorism;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/18.650987
Filename :
650987
Link To Document :
بازگشت