• DocumentCode
    476278
  • Title

    Disambiguating effectively Chinese polyphonic ambiguity based on unify approach

  • Author

    Huang, Feng-Long

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. United Univ., Miaoli
  • Volume
    6
  • fYear
    2008
  • fDate
    12-15 July 2008
  • Firstpage
    3242
  • Lastpage
    3246
  • Abstract
    One of the difficult tasks on Natural Language Processing (NLP) is to resolve the sense ambiguity of characters or words on text, such as polyphones, homonymy, and homograph. The paper addresses the ambiguity issue of Chinese character polyphones and disambiguity approaches for such issues. Three methods, dictionary matching, language models and voting scheme, are used to disambiguate the prediction of polyphones. The best precision rate for these methods achieves 92.65%. Furthermore we proposed the unify approaches to improve the performance with respect to various threshold value. Comparing with the well-known MS Word 2007, our approach is superior and enhances the final precision rate up to 93.32%.
  • Keywords
    dictionaries; natural language processing; Chinese character polyphones; Chinese polyphonic ambiguity; dictionary matching; natural language processing; Dictionaries; Frequency; Information analysis; Information retrieval; Natural language processing; Natural languages; Predictive models; Speech analysis; Speech processing; Voting; Language Model; Sense Disambiguity; Unify Approach; Voting Scheme;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2008 International Conference on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-2095-7
  • Electronic_ISBN
    978-1-4244-2096-4
  • Type

    conf

  • DOI
    10.1109/ICMLC.2008.4620965
  • Filename
    4620965