• DocumentCode
    3300511
  • Title

    Using synchronization bits to boost compression by substring enumeration

  • Author

    Dubé, Danny

  • Author_Institution
    Univ. Laval, Quebec City, QC, Canada
  • fYear
    2010
  • fDate
    17-20 Oct. 2010
  • Firstpage
    82
  • Lastpage
    87
  • Abstract
    A new lossless data compression technique called compression via substring enumeration (CSE) has recently been introduced. It has been observed that CSE achieves lower performance on binary data. An hypothesis has been formulated that suggests that CSE loses track of the position of the bits relative to the byte boundaries more easily in binary data and that this confusion incurs a penalty for CSE. This paper questions the validity of the hypothesis and proposes a simple technique to reduce the penalty, in case the hypothesis is correct. The technique consists in adding a preprocessing step that inserts synchronization bits in the data in order to boost the performance of CSE. Experiments provide strong evidence that the formulated hypothesis is true and they demonstrate the effectiveness of the use of synchronization bits.
  • Keywords
    data compression; synchronisation; binary data; byte boundaries; compression by substring enumeration; lossless data compression technique; preprocessing step; synchronization bits; Benchmark testing; Data compression; Data structures; Lead; Phase measurement; Synchronization; Transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Theory and its Applications (ISITA), 2010 International Symposium on
  • Conference_Location
    Taichung
  • Print_ISBN
    978-1-4244-6016-8
  • Electronic_ISBN
    978-1-4244-6017-5
  • Type

    conf

  • DOI
    10.1109/ISITA.2010.5649565
  • Filename
    5649565