DocumentCode :
2945195
Title :
Tree Structure Compression with RePair
Author :
Lohrey, Markus ; Maneth, Sebastian ; Mennicke, Roy
Author_Institution :
Univ. Leipzig, Leipzig, Germany
fYear :
2011
fDate :
29-31 March 2011
Firstpage :
353
Lastpage :
362
Abstract :
Larsson and Moffat´s RePair algorithm is generalized from strings to trees. The new algorithm (TreeRePair) produces straight-line linear context-free tree (SLT) grammars which are smaller than those produced by previous grammar-based compressors such as BPLEX. Experiments show that a Huffman-based coding of the resulting grammars gives compression ratios comparable to the best known XML file compressors. Moreover, SLT grammars can be used as efficient memory representation of trees. Our investigations show that tree traversals over TreeRePair grammars are 14 times slower than over pointer structures and 5 times slower than over succinct trees, while memory consumption is only 1/43 and 1/6, respectively.
Keywords :
Huffman codes; context-free grammars; data compression; tree data structures; Huffman-based coding; Larsson RePair algorithm; Moffat RePair algorithm; TreeRePair algorithm; XML file compressors; grammar-based compressors; straight-line linear context-free tree grammars; tree structure compression; Compressors; Encoding; Grammar; Maintenance engineering; Production; Vegetation; XML; XML compression; grammar-based compression; tree compression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Compression Conference (DCC), 2011
Conference_Location :
Snowbird, UT
ISSN :
1068-0314
Print_ISBN :
978-1-61284-279-0
Type :
conf
DOI :
10.1109/DCC.2011.42
Filename :
5749493
Link To Document :
بازگشت