Title :
A hybrid approach to automatic Chinese text checking and error correction
Author :
Ren, Fuji ; Shi, Hongchi ; Zhou, Qiang
Author_Institution :
Fac. of Eng., Tokushima Univ., Japan
Abstract :
Automatic Chinese text checking and error correction is an important and difficult problem. Compared with automatic checking and error correction of Western text automatic checking and error correction of Chinese text faces more challenges. The Chinese language has many characters and no delimiters separating words. It is impossible to detect. and correct errors by penetrating into the inner composition of a character. In this paper, we describe some special features of Chinese characters and text and some statistical information obtained from a real world Chinese text corpus, and we present a hybrid approach that combines a rule-based method and a probability-based method to automatic checking and error correction of Chinese text. We also present an experimental system, HSACCCT (Hybrid System of Automatic Checking and Correction for Chinese Text), that implements this hybrid approach and some experimental results on real world Chinese text
Keywords :
character recognition; natural languages; text analysis; Chinese characters; Chinese language; Chinese text checking; Chinese text corpus; HSACCCT; error correction; probability-based method; rule-based method; Code standards; Computer science; Dictionaries; Error correction; Error probability; Face detection; Morphology; Natural languages; Neural networks; Text processing;
Conference_Titel :
Systems, Man, and Cybernetics, 2001 IEEE International Conference on
Conference_Location :
Tucson, AZ
Print_ISBN :
0-7803-7087-2
DOI :
10.1109/ICSMC.2001.973529