• DocumentCode
    3583804
  • Title

    A disambiguate method to covering ambiguity based on the collocation information

  • Author

    Feng, Su-Qin ; Jiao, Li-juan

  • Author_Institution
    Dept. of Comput. Sci., XinZhou Teachers Univ., Xinzhou, China
  • Volume
    7
  • fYear
    2010
  • Firstpage
    3669
  • Lastpage
    3672
  • Abstract
    Covering ambiguity is a vital issue in Chinese word segmentation. The paper presents the disambiguation strategies based on the collocation information. Firstly, it gets the word that is combinatorial ambiguities from a larger scaled corpus, then counts up it´s collocation information. Lastly it uses multi maximal log algorithm for disambiguation. Further the paper uses disambiguated corpus to strengthen and stabilize collocations It is proved to be an easy and effective way in the experiments.
  • Keywords
    natural language processing; Chinese word segmentation; collocation information; combinatorial ambiguity; disambiguate method; multimaximal log algorithm; Accuracy; Algorithm design and analysis; Computers; Context; Heuristic algorithms; Sun; Training; Chinese word segmentation; Collocation information; Covering ambiguities; Disambiguate; multi maximal log;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation (ICNC), 2010 Sixth International Conference on
  • Print_ISBN
    978-1-4244-5958-2
  • Type

    conf

  • DOI
    10.1109/ICNC.2010.5583733
  • Filename
    5583733