• DocumentCode
    2945466
  • Title

    On the Use of Stronger Synchronization to Boost Compression by Substring Enumeration

  • Author

    Dubé, Danny

  • fYear
    2011
  • fDate
    29-31 March 2011
  • Firstpage
    454
  • Lastpage
    454
  • Abstract
    A new lossless data compression technique called compression by substring enumeration (CSE) has recently been introduced. CSE is competitive but it achieves lower performance on non-text-like data. More recent work confirmed that CSE incurs a penalty due to the fact that it is unaware of the position (or phase) of the bits relative to the byte boundaries. That work demonstrated that CSE can be boosted by adding a preprocessing step in which synchronization bits are inserted in the data. Various synchronization schemes were used and, in general, it has been observed that the more we insert bits, the more we improve the compression, with the best results obtained using a reliable scheme that inserts 5 bits per byte (n = 13). In this work, the authors measure the boost when using even stronger schemes. The results are negative: the use of stronger schemes (n <; 13) brings only minimal improvements.
  • Keywords
    data compression; synchronisation; compression by substring enumeration; lossless data compression technique; strong synchronization; synchronization bits; Data compression; Shape; Synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference (DCC), 2011
  • Conference_Location
    Snowbird, UT
  • ISSN
    1068-0314
  • Print_ISBN
    978-1-61284-279-0
  • Type

    conf

  • DOI
    10.1109/DCC.2011.58
  • Filename
    5749511