• DocumentCode
    517686
  • Title

    Adaptive Chinese Combinatorial Ambiguities Disambiguate Method

  • Author

    Feng, Su-Qin ; Feng, Li-Peng

  • Author_Institution
    Dept. of Comput. Sci., XinZhou Teachers Univ., Xinzhou, China
  • Volume
    1
  • fYear
    2010
  • fDate
    24-25 April 2010
  • Firstpage
    168
  • Lastpage
    171
  • Abstract
    Combinatorial ambiguity has always been a vital issue in Chinese word segmentation. This paper presented a novel way for disambiguation by use of a multi maximal log likelihood ratio of the cooperative statistical table, which took the cooperative examples provided by the artificial checked word segmentation as the initial cooperative knowledge of covering ambiguity. Then the key factors were determined based on the experiments regarding combinatorial ambiguities (the size of the context windows, the sensitivity of locations in the windows as well as weighting of feature word, et al). On this basis, an adaptive Chinese combinatorial ambiguities disambiguate method was used to strengthen and stabilize collocations. At lastly, the method was tested and proved that it can well raise accuracy.
  • Keywords
    artificial intelligence; natural language processing; statistical analysis; Chinese word segmentation; adaptive Chinese combinatorial ambiguities disambiguate method; artificial checked word segmentation; cooperative statistical table; multimaximal log likelihood ratio; Computer networks; Computer science; Computer security; Information processing; Large-scale systems; Natural languages; Statistical analysis; Statistics; Testing; Wireless communication; Chinese word segmentation; adaptive method; combinatorial ambiguities; disambiguate; natural language processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networks Security Wireless Communications and Trusted Computing (NSWCTC), 2010 Second International Conference on
  • Conference_Location
    Wuhan, Hubei
  • Print_ISBN
    978-0-7695-4011-5
  • Electronic_ISBN
    978-1-4244-6598-9
  • Type

    conf

  • DOI
    10.1109/NSWCTC.2010.46
  • Filename
    5480531