DocumentCode
1568516
Title
Dynamic Markov Compression Using a Crossbar-Like Tree Initial Structure for Chinese Texts
Author
Ong, Ghim-Hwee ; Ng, Jun-Ping
Author_Institution
Dept. of Comput. Sci., Nat. Univ. of Singapore
Volume
2
fYear
2005
Firstpage
407
Lastpage
410
Abstract
This paper proposes the use of a crossbar-like tree structure to use with dynamic Markov compression (DMC) for the compression of Chinese text files. DMC had previously been found to be more effective than common compression techniques like compress and pack and gives a compression gain of between 13.1% and 32.0%. This initial structure is able to improve on DMC´s compression results, and outperforms the various initial structures commonly adopted, such as the single-state, linear, tree or braid structures by a gain ranging from 1.5% to 9.6%
Keywords
Markov processes; data compression; natural languages; text analysis; tree data structures; Chinese text file; crossbar-like tree initial structure; dynamic Markov compression; Arithmetic; Binary trees; Cloning; Computer science; Encoding; Information technology; Predictive models; Probability distribution; Tree data structures;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology and Applications, 2005. ICITA 2005. Third International Conference on
Conference_Location
Sydney, NSW
Print_ISBN
0-7695-2316-1
Type
conf
DOI
10.1109/ICITA.2005.119
Filename
1488995
Link To Document