DocumentCode
476278
Title
Disambiguating effectively Chinese polyphonic ambiguity based on unify approach
Author
Huang, Feng-Long
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. United Univ., Miaoli
Volume
6
fYear
2008
fDate
12-15 July 2008
Firstpage
3242
Lastpage
3246
Abstract
One of the difficult tasks on Natural Language Processing (NLP) is to resolve the sense ambiguity of characters or words on text, such as polyphones, homonymy, and homograph. The paper addresses the ambiguity issue of Chinese character polyphones and disambiguity approaches for such issues. Three methods, dictionary matching, language models and voting scheme, are used to disambiguate the prediction of polyphones. The best precision rate for these methods achieves 92.65%. Furthermore we proposed the unify approaches to improve the performance with respect to various threshold value. Comparing with the well-known MS Word 2007, our approach is superior and enhances the final precision rate up to 93.32%.
Keywords
dictionaries; natural language processing; Chinese character polyphones; Chinese polyphonic ambiguity; dictionary matching; natural language processing; Dictionaries; Frequency; Information analysis; Information retrieval; Natural language processing; Natural languages; Predictive models; Speech analysis; Speech processing; Voting; Language Model; Sense Disambiguity; Unify Approach; Voting Scheme;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location
Kunming
Print_ISBN
978-1-4244-2095-7
Electronic_ISBN
978-1-4244-2096-4
Type
conf
DOI
10.1109/ICMLC.2008.4620965
Filename
4620965
Link To Document