DocumentCode :
2735579
Title :
The prefix and suffix query of Chinese word segmentation algorithm for maximum matching
Author :
Ye, Junmin ; Li, Songsong ; Hao, Guangquan ; Li, Shizi ; Yang, Yan ; Jin, Cong
Author_Institution :
Dept. of Comput. Sci., HuaZhong Normal Univ., Wuhan, China
fYear :
2011
fDate :
21-23 Oct. 2011
Firstpage :
74
Lastpage :
77
Abstract :
Chinese word segmentation is a key technology for automatic summarization. Whether the segmentation is successful and has no ambiguity or not will directly affect sentence weight calculation. In the segmentation process, the structure of word segmentation dictionary is particularly important. A rational structure of a word segmentation dictionary can improve the segmentation process of the dictionary query speed, and, thus, improves the efficiency of Chinese word segmentation. This paper is based on the dictionary of Hash structure. In this paper, the authors adopt the prefix and suffix query approach to build word segmentation dictionary, combined with an improved Chinese word segmentation algorithm for maximum matching to improve the segmentation efficiency and accuracy.
Keywords :
dictionaries; natural language processing; query processing; Chinese word segmentation algorithm; automatic summarization; dictionary query speed; hash structure dictionary; maximum matching; prefix query; sentence weight calculation; suffix query; word segmentation dictionary structure; Accuracy; Algorithm design and analysis; Complexity theory; Computers; Dictionaries; Educational institutions; Indexes; Chinese word segmentation; maximum matching algorithm; prefix and suffix query; word segmentation dictionary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Analysis and Signal Processing (IASP), 2011 International Conference on
Conference_Location :
Hubei
Print_ISBN :
978-1-61284-879-2
Type :
conf
DOI :
10.1109/IASP.2011.6109001
Filename :
6109001
Link To Document :
بازگشت