DocumentCode
535934
Title
Research and Implementation on a Hybrid Algorithm for Chinese Automatic Error-detecting
Author
Jinjin, Zhu ; Yangsen, Zhang
Author_Institution
Inst. of Intell. Inf. Process., Univ. Beijing, Beijing, China
Volume
1
fYear
2010
fDate
23-24 Oct. 2010
Firstpage
413
Lastpage
417
Abstract
The automatic error-detecting system is implemented on a hybrid algorithm combining with n-gram model, dependency parsing, Hownet and rules created and used to detect different error types of Chinese text. First of all, four different n-gram models are employed to analyze the separate strings in the segmented texts and detect the lexical errors, and experiments are made on the frequency statistics of words and characters. Subsequently, dependency parsing and Hownet are introduced into automatic proofreading and help to detect semantic errors. Dependency grammar parses the whole sentence and denotes dominating and dominated relation among the words, combined with Hownet can efficiently check the semantic errors. In addition, grammatical collocation rules are made to check the syntax errors, in order to fill up the deficiency of the two methods above. Finally an ideal automatically detecting error system is obtained with precision of 69.66% and recall of 84.16%.
Keywords
error detection; natural language processing; text analysis; chinese automatic error detecting; frequency statistics; grammatical collocation rules; hybrid algorithm; lexical error detection; n-gram model; semantic errors; syntax errors; text segmentation; Accuracy; Analytical models; Error analysis; Grammar; Information processing; Semantics; Syntactics; Hownet; dependency parsing; error-detecting; lexical; n-gram; semantic; separate string; syntax;
fLanguage
English
Publisher
ieee
Conference_Titel
Artificial Intelligence and Computational Intelligence (AICI), 2010 International Conference on
Conference_Location
Sanya
Print_ISBN
978-1-4244-8432-4
Type
conf
DOI
10.1109/AICI.2010.93
Filename
5655638
Link To Document