• DocumentCode
    1809002
  • Title

    An Improved Chinese Segmentation Algorithm Based on New Dictionary Construction

  • Author

    Niu, Yan ; Li, Lala

  • Author_Institution
    Comput. Coll., Hubei Univ. of Technol., Wuhan, China
  • Volume
    2
  • fYear
    2009
  • fDate
    29-31 Aug. 2009
  • Firstpage
    993
  • Lastpage
    996
  • Abstract
    In this paper, we make use of the result of word frequency statistics design a new dictionary construction and propose an improved FMM algorithm which based on analysis the principle and characteristics of traditional FMM algorithm. Through the time complexity analysis and experimental comparison, the improved FMM algorithm can further improve the efficiency of the Chinese word segmentation.
  • Keywords
    dictionaries; natural language processing; statistical analysis; Chinese segmentation algorithm; dictionary construction; forward maximum matching algorithm; word frequency statistics; Algorithm design and analysis; Design engineering; Dictionaries; Educational institutions; Frequency; Handicapped aids; Information processing; Natural languages; Statistical analysis; Vocabulary; Chinese Segmentation; Chinese word frequency; FMM algorithm; segmentation dictionary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Science and Engineering, 2009. CSE '09. International Conference on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    978-1-4244-5334-4
  • Electronic_ISBN
    978-0-7695-3823-5
  • Type

    conf

  • DOI
    10.1109/CSE.2009.269
  • Filename
    5283472