DocumentCode
1809002
Title
An Improved Chinese Segmentation Algorithm Based on New Dictionary Construction
Author
Niu, Yan ; Li, Lala
Author_Institution
Comput. Coll., Hubei Univ. of Technol., Wuhan, China
Volume
2
fYear
2009
fDate
29-31 Aug. 2009
Firstpage
993
Lastpage
996
Abstract
In this paper, we make use of the result of word frequency statistics design a new dictionary construction and propose an improved FMM algorithm which based on analysis the principle and characteristics of traditional FMM algorithm. Through the time complexity analysis and experimental comparison, the improved FMM algorithm can further improve the efficiency of the Chinese word segmentation.
Keywords
dictionaries; natural language processing; statistical analysis; Chinese segmentation algorithm; dictionary construction; forward maximum matching algorithm; word frequency statistics; Algorithm design and analysis; Design engineering; Dictionaries; Educational institutions; Frequency; Handicapped aids; Information processing; Natural languages; Statistical analysis; Vocabulary; Chinese Segmentation; Chinese word frequency; FMM algorithm; segmentation dictionary;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Science and Engineering, 2009. CSE '09. International Conference on
Conference_Location
Vancouver, BC
Print_ISBN
978-1-4244-5334-4
Electronic_ISBN
978-0-7695-3823-5
Type
conf
DOI
10.1109/CSE.2009.269
Filename
5283472
Link To Document