DocumentCode :
2416716
Title :
A semi-adaptive arithmetic coding scheme for Chinese textual data
Author :
Ong, Ghim Hwee ; Huang, Shell Ying
Author_Institution :
Dept. of Inf. Syst. & Comput. Sci., Nat. Univ. of Singapore, Singapore
Volume :
2
fYear :
1993
fDate :
6-11 Sep 1993
Firstpage :
813
Abstract :
This paper presents a compression scheme for Chinese text files. Due to the skewness of the distribution of Chinese ideograms the arithmetic coding method is adopted. To reduce the overhead incurred by the frequency table in the compressed output due to the large number of Chinese ideograms, differential coding and arithmetic coding are used to produce a two-level storage structure for the frequency table. Evaluations of the proposed algorithm against several popular compression schemes show that the compression efficiency is significantly improved. This algorithm should also be applicable to other ideogram-based or oriental language texts
Keywords :
adaptive codes; arithmetic codes; data compression; data structures; word processing; Chinese ideograms; Chinese text files; algorithm; arithmetic coding method; compression efficiency; differential coding; frequency table; overhead; two-level storage structure; Arithmetic; Computer science; Data compression; Databases; Encoding; Entropy; Frequency; Information systems; Natural languages; Text processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Networks, 1993. International Conference on Information Engineering '93. 'Communications and Networks for the Year 2000', Proceedings of IEEE Singapore International Conference on
Print_ISBN :
0-7803-1445-X
Type :
conf
DOI :
10.1109/SICON.1993.515700
Filename :
515700
Link To Document :
بازگشت