DocumentCode
3583804
Title
A disambiguate method to covering ambiguity based on the collocation information
Author
Feng, Su-Qin ; Jiao, Li-juan
Author_Institution
Dept. of Comput. Sci., XinZhou Teachers Univ., Xinzhou, China
Volume
7
fYear
2010
Firstpage
3669
Lastpage
3672
Abstract
Covering ambiguity is a vital issue in Chinese word segmentation. The paper presents the disambiguation strategies based on the collocation information. Firstly, it gets the word that is combinatorial ambiguities from a larger scaled corpus, then counts up it´s collocation information. Lastly it uses multi maximal log algorithm for disambiguation. Further the paper uses disambiguated corpus to strengthen and stabilize collocations It is proved to be an easy and effective way in the experiments.
Keywords
natural language processing; Chinese word segmentation; collocation information; combinatorial ambiguity; disambiguate method; multimaximal log algorithm; Accuracy; Algorithm design and analysis; Computers; Context; Heuristic algorithms; Sun; Training; Chinese word segmentation; Collocation information; Covering ambiguities; Disambiguate; multi maximal log;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Computation (ICNC), 2010 Sixth International Conference on
Print_ISBN
978-1-4244-5958-2
Type
conf
DOI
10.1109/ICNC.2010.5583733
Filename
5583733
Link To Document