DocumentCode :
2189513
Title :
High-Order Text Compression on Hierarchical Edge-Guided
Author :
Martinez-Prieto, M.A. ; Adiego, Joaquín ; Fuente, P. ; Fernandez, Javier D
Author_Institution :
Depto. de Inf., Univ. de Valladolid, Valladolid, Spain
fYear :
2010
fDate :
24-26 March 2010
Firstpage :
543
Lastpage :
543
Abstract :
Summary form only given.The hierarchical Edge-Guided techniques (called E-Gfc) enhance the original E-G approach to support high-order text statistics. These consider the same graph-based model to represent an extended input alphabet obtained by using a variant of the Re-Pair algorithm. E-Gfc adapts the previous coding scheme to grasp the features of the bit-oriented canonical Huffman code chosen as output alphabet.
Keywords :
Huffman codes; graph theory; higher order statistics; text analysis; word processing; bit-oriented canonical Huffman code; graph-based model; hierarchical edge-guided techniques; high-order text compression; high-order text statistics; input alphabet; output alphabet; re-pair algorithm; Compressors; Context modeling; Data compression; Encoding; Government; Joining processes; Natural languages; Statistics; Vocabulary; Edge-Guided; High-Order Word-based Model; Re-Pair; Text Compression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Compression Conference (DCC), 2010
Conference_Location :
Snowbird, UT
ISSN :
1068-0314
Print_ISBN :
978-1-4244-6425-8
Electronic_ISBN :
1068-0314
Type :
conf
DOI :
10.1109/DCC.2010.72
Filename :
5453496
Link To Document :
بازگشت