• DocumentCode
    2292021
  • Title

    Compression of low entropy strings with Lempel-Ziv algorithms

  • Author

    Kosaraju, S. Rao ; Manzini, Giovanni

  • Author_Institution
    Dept. of Comput. Sci., Johns Hopkins Univ., Baltimore, MD, USA
  • fYear
    1997
  • fDate
    11-13 Jun 1997
  • Firstpage
    107
  • Lastpage
    121
  • Abstract
    We compare the compression ratio of the Lempel-Ziv algorithms with the empirical entropy of the input string, We show that although these algorithms are optimal according to the generally accepted definition, we can find families of low entropy strings which are not compressed optimally. More precisely, we show that the compression ratio achieved by LZ78 (resp. LZ77) can be much higher than the zeroth order entropy H 0 (resp. the first order entropy H1). We present a compression algorithm which combines LZ78 with run length encoding, and we show that for any string s the new algorithm achieves a compression ratio bounded by 3H0(s). Finally, we prove that for any string s the compression ratio achieved by LZ77 is bounded by 8H0 (s)
  • Keywords
    data compression; entropy; runlength codes; LZ77 algorithm; LZ78 algorithm; Lempel-Ziv algorithms; compression ratio; data compression; low entropy strings; run length encoding; Algorithm design and analysis; Compression algorithms; Computer science; Data compression; Decoding; Encoding; Entropy; Performance analysis; Probability distribution; Random sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Compression and Complexity of Sequences 1997. Proceedings
  • Conference_Location
    Salerno
  • Print_ISBN
    0-8186-8132-2
  • Type

    conf

  • DOI
    10.1109/SEQUEN.1997.666907
  • Filename
    666907