Title :
Data compression using antidictionaries
Author :
Crochemore, Maxime ; Mignosi, Filippo ; Restivo, Antonio ; Salemi, Sergio
Author_Institution :
Inst. Gaspard-Monge, Univ. de Marne-la-Vallee, France
Abstract :
We give a new text-compression scheme based on forbidden words ("antidictionary"). We prove that our algorithms attain the entropy for balanced binary sources. They run in linear time. Moreover, one of the main advantages of this approach is that it produces very fast decompressors. A second advantage is a synchronization property that is helpful to search compressed data and allows parallel compression. The techniques used in this paper are from information theory and finite automata.
Keywords :
data compression; entropy; finite automata; information theory; pattern matching; synchronisation; antidictionaries; balanced binary sources; data compression; decompressors; entropy; finite automata; forbidden words; information theory; linear time; parallel compression; synchronization property; text-compression scheme; Automata; Data compression; Decoding; Dictionaries; Entropy; Humans; Information theory; Pattern matching; Psychology; Read only memory;
Journal_Title :
Proceedings of the IEEE