• DocumentCode
    535934
  • Title

    Research and Implementation on a Hybrid Algorithm for Chinese Automatic Error-detecting

  • Author

    Jinjin, Zhu ; Yangsen, Zhang

  • Author_Institution
    Inst. of Intell. Inf. Process., Univ. Beijing, Beijing, China
  • Volume
    1
  • fYear
    2010
  • fDate
    23-24 Oct. 2010
  • Firstpage
    413
  • Lastpage
    417
  • Abstract
    The automatic error-detecting system is implemented on a hybrid algorithm combining with n-gram model, dependency parsing, Hownet and rules created and used to detect different error types of Chinese text. First of all, four different n-gram models are employed to analyze the separate strings in the segmented texts and detect the lexical errors, and experiments are made on the frequency statistics of words and characters. Subsequently, dependency parsing and Hownet are introduced into automatic proofreading and help to detect semantic errors. Dependency grammar parses the whole sentence and denotes dominating and dominated relation among the words, combined with Hownet can efficiently check the semantic errors. In addition, grammatical collocation rules are made to check the syntax errors, in order to fill up the deficiency of the two methods above. Finally an ideal automatically detecting error system is obtained with precision of 69.66% and recall of 84.16%.
  • Keywords
    error detection; natural language processing; text analysis; chinese automatic error detecting; frequency statistics; grammatical collocation rules; hybrid algorithm; lexical error detection; n-gram model; semantic errors; syntax errors; text segmentation; Accuracy; Analytical models; Error analysis; Grammar; Information processing; Semantics; Syntactics; Hownet; dependency parsing; error-detecting; lexical; n-gram; semantic; separate string; syntax;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence and Computational Intelligence (AICI), 2010 International Conference on
  • Conference_Location
    Sanya
  • Print_ISBN
    978-1-4244-8432-4
  • Type

    conf

  • DOI
    10.1109/AICI.2010.93
  • Filename
    5655638