DocumentCode
2945466
Title
On the Use of Stronger Synchronization to Boost Compression by Substring Enumeration
Author
Dubé, Danny
fYear
2011
fDate
29-31 March 2011
Firstpage
454
Lastpage
454
Abstract
A new lossless data compression technique called compression by substring enumeration (CSE) has recently been introduced. CSE is competitive but it achieves lower performance on non-text-like data. More recent work confirmed that CSE incurs a penalty due to the fact that it is unaware of the position (or phase) of the bits relative to the byte boundaries. That work demonstrated that CSE can be boosted by adding a preprocessing step in which synchronization bits are inserted in the data. Various synchronization schemes were used and, in general, it has been observed that the more we insert bits, the more we improve the compression, with the best results obtained using a reliable scheme that inserts 5 bits per byte (n = 13). In this work, the authors measure the boost when using even stronger schemes. The results are negative: the use of stronger schemes (n <; 13) brings only minimal improvements.
Keywords
data compression; synchronisation; compression by substring enumeration; lossless data compression technique; strong synchronization; synchronization bits; Data compression; Shape; Synchronization;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference (DCC), 2011
Conference_Location
Snowbird, UT
ISSN
1068-0314
Print_ISBN
978-1-61284-279-0
Type
conf
DOI
10.1109/DCC.2011.58
Filename
5749511
Link To Document