• DocumentCode
    2398519
  • Title

    Multiple-dictionary compression using partial matching

  • Author

    Hoang, Dzung T. ; Long, Philip M. ; Vitter, Jeffrey Scott

  • Author_Institution
    Dept. of Comput. Sci., Duke Univ., Durham, NC, USA
  • fYear
    1995
  • fDate
    28-30 Mar 1995
  • Firstpage
    272
  • Lastpage
    281
  • Abstract
    Motivated by the desire to find text compressors that compress better than existing dictionary methods, but run faster than PPM implementations, we describe methods for text compression using multiple dictionaries, one for each context of preceding characters, where the contexts have varying lengths. The context to be used is determined using an escape mechanism similar to that of PPM methods. We describe modifications of three popular dictionary coders along these lines and experiments evaluating their efficacy using the text files in the Calgary corpus. Our results suggest that modifying LZ77 along these lines yields an improvement in compression of about 4%, that modifying LZFG yields a compression improvement of about 8%, and that modifying LZW in this manner yields an average improvement on the order of 12%
  • Keywords
    data compression; encoding; Calgary corpus; escape mechanism; multiple dictionaries; multiple-dictionary compression; partial matching; text compressors; text files; Arithmetic; Compressors; Computer science; Dictionaries; Encoding; Entropy; Probability distribution; Statistical analysis; Statistics; Yield estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference, 1995. DCC '95. Proceedings
  • Conference_Location
    Snowbird, UT
  • ISSN
    1068-0314
  • Print_ISBN
    0-8186-7012-6
  • Type

    conf

  • DOI
    10.1109/DCC.1995.515517
  • Filename
    515517