• DocumentCode
    2053374
  • Title

    A hybrid approach to automatic Chinese text checking and error correction

  • Author

    Ren, Fuji ; Shi, Hongchi ; Zhou, Qiang

  • Author_Institution
    Fac. of Eng., Tokushima Univ., Japan
  • Volume
    3
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    1693
  • Abstract
    Automatic Chinese text checking and error correction is an important and difficult problem. Compared with automatic checking and error correction of Western text automatic checking and error correction of Chinese text faces more challenges. The Chinese language has many characters and no delimiters separating words. It is impossible to detect. and correct errors by penetrating into the inner composition of a character. In this paper, we describe some special features of Chinese characters and text and some statistical information obtained from a real world Chinese text corpus, and we present a hybrid approach that combines a rule-based method and a probability-based method to automatic checking and error correction of Chinese text. We also present an experimental system, HSACCCT (Hybrid System of Automatic Checking and Correction for Chinese Text), that implements this hybrid approach and some experimental results on real world Chinese text
  • Keywords
    character recognition; natural languages; text analysis; Chinese characters; Chinese language; Chinese text checking; Chinese text corpus; HSACCCT; error correction; probability-based method; rule-based method; Code standards; Computer science; Dictionaries; Error correction; Error probability; Face detection; Morphology; Natural languages; Neural networks; Text processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 2001 IEEE International Conference on
  • Conference_Location
    Tucson, AZ
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-7087-2
  • Type

    conf

  • DOI
    10.1109/ICSMC.2001.973529
  • Filename
    973529