• DocumentCode
    1160994
  • Title

    Upper bounds on the probability of sequences emitted by finite-state sources and on the redundancy of the Lempel-Ziv algorithm

  • Author

    Plotnik, Eli ; Weinberger, Marcelo J. ; Ziv, Jacob

  • Author_Institution
    Dept. of Electr. Eng., Technion, Haifa, Israel Inst. of Technol., Israel
  • Volume
    38
  • Issue
    1
  • fYear
    1992
  • fDate
    1/1/1992 12:00:00 AM
  • Firstpage
    66
  • Lastpage
    72
  • Abstract
    An upper bound on the probability of a sequence drawn from a finite-state source is derived. The bound is given in terms of the number of phrases obtained by parsing the sequence according to the Lempel-Ziv (L-Z) incremental parsing rule, and is universal in the sense that it does not depend on the statistical parameters that characterize the source. This bound is used to derive an upper bound on the redundance of the L-Z universal data compression algorithm applied to finite-state sources, that depends on the length N of the sequence, on the number K of states of the source, and, eventually, on the source entropy. A variation of the L-Z algorithm is presented, and an upper bound on its redundancy is derived for finite-state sources. A method to derive tighter implicit upper bounds on the redundancy of both algorithms is also given, and it is shown that for the proposed variation this bound is smaller than for the original L-Z algorithm, or every value of N and K
  • Keywords
    data compression; encoding; information theory; redundancy; Lempel-Ziv algorithm; encoding; finite-state sources; incremental parsing rule; parsing; probability of sequences; redundancy; source entropy; universal data compression; upper bound; Binary sequences; Cities and towns; Data compression; Entropy; Information theory; Jacobian matrices; Probability; Upper bound;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/18.108250
  • Filename
    108250