• DocumentCode
    1568516
  • Title

    Dynamic Markov Compression Using a Crossbar-Like Tree Initial Structure for Chinese Texts

  • Author

    Ong, Ghim-Hwee ; Ng, Jun-Ping

  • Author_Institution
    Dept. of Comput. Sci., Nat. Univ. of Singapore
  • Volume
    2
  • fYear
    2005
  • Firstpage
    407
  • Lastpage
    410
  • Abstract
    This paper proposes the use of a crossbar-like tree structure to use with dynamic Markov compression (DMC) for the compression of Chinese text files. DMC had previously been found to be more effective than common compression techniques like compress and pack and gives a compression gain of between 13.1% and 32.0%. This initial structure is able to improve on DMC´s compression results, and outperforms the various initial structures commonly adopted, such as the single-state, linear, tree or braid structures by a gain ranging from 1.5% to 9.6%
  • Keywords
    Markov processes; data compression; natural languages; text analysis; tree data structures; Chinese text file; crossbar-like tree initial structure; dynamic Markov compression; Arithmetic; Binary trees; Cloning; Computer science; Encoding; Information technology; Predictive models; Probability distribution; Tree data structures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology and Applications, 2005. ICITA 2005. Third International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    0-7695-2316-1
  • Type

    conf

  • DOI
    10.1109/ICITA.2005.119
  • Filename
    1488995