Title :
Disambiguating effectively Chinese polyphonic ambiguity based on unify approach
Author :
Huang, Feng-Long
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. United Univ., Miaoli
Abstract :
One of the difficult tasks on Natural Language Processing (NLP) is to resolve the sense ambiguity of characters or words on text, such as polyphones, homonymy, and homograph. The paper addresses the ambiguity issue of Chinese character polyphones and disambiguity approaches for such issues. Three methods, dictionary matching, language models and voting scheme, are used to disambiguate the prediction of polyphones. The best precision rate for these methods achieves 92.65%. Furthermore we proposed the unify approaches to improve the performance with respect to various threshold value. Comparing with the well-known MS Word 2007, our approach is superior and enhances the final precision rate up to 93.32%.
Keywords :
dictionaries; natural language processing; Chinese character polyphones; Chinese polyphonic ambiguity; dictionary matching; natural language processing; Dictionaries; Frequency; Information analysis; Information retrieval; Natural language processing; Natural languages; Predictive models; Speech analysis; Speech processing; Voting; Language Model; Sense Disambiguity; Unify Approach; Voting Scheme;
Conference_Titel :
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2095-7
Electronic_ISBN :
978-1-4244-2096-4
DOI :
10.1109/ICMLC.2008.4620965