DocumentCode :
476278
Title :
Disambiguating effectively Chinese polyphonic ambiguity based on unify approach
Author :
Huang, Feng-Long
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. United Univ., Miaoli
Volume :
6
fYear :
2008
fDate :
12-15 July 2008
Firstpage :
3242
Lastpage :
3246
Abstract :
One of the difficult tasks on Natural Language Processing (NLP) is to resolve the sense ambiguity of characters or words on text, such as polyphones, homonymy, and homograph. The paper addresses the ambiguity issue of Chinese character polyphones and disambiguity approaches for such issues. Three methods, dictionary matching, language models and voting scheme, are used to disambiguate the prediction of polyphones. The best precision rate for these methods achieves 92.65%. Furthermore we proposed the unify approaches to improve the performance with respect to various threshold value. Comparing with the well-known MS Word 2007, our approach is superior and enhances the final precision rate up to 93.32%.
Keywords :
dictionaries; natural language processing; Chinese character polyphones; Chinese polyphonic ambiguity; dictionary matching; natural language processing; Dictionaries; Frequency; Information analysis; Information retrieval; Natural language processing; Natural languages; Predictive models; Speech analysis; Speech processing; Voting; Language Model; Sense Disambiguity; Unify Approach; Voting Scheme;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2095-7
Electronic_ISBN :
978-1-4244-2096-4
Type :
conf
DOI :
10.1109/ICMLC.2008.4620965
Filename :
4620965
Link To Document :
بازگشت