Title :
Application of Chinese Word Segmentation Based on Linguistic Environment Analysis in Text Information Filtering System
Author :
Yi, Zhi-an ; Lv, Jia
Author_Institution :
Coll. of Comput. & Inf. Technol., Daqing Pet. Inst., Daqing
Abstract :
This paper provides Chinese word segmentation based on language analysis problem in text information filtering system. The improved Chinese word segmentation is made of a bigram segmentation and a segmentation correction, new words recognition and disambiguation through the bigram segmentation, check the accuracy of segmentation results using the segmentation correction from the perspective of syntax. It has been proved by experiments that the segmentation not only strengthen the system´s language analysis ability, but also improve the accuracy of text information filtering system when the improved Chinese word segmentation was applied to the text analysis module.
Keywords :
information filtering; natural language processing; text analysis; word processing; Chinese word segmentation; bigram segmentation; language analysis problem; linguistic environment analysis; segmentation correction; syntax; text information filtering system; words recognition; Computer applications; Educational institutions; Information analysis; Information filtering; Information science; Information technology; Internet; Natural languages; Petroleum; Text analysis; Chinese word segmentation; Information filtration; disambiguation; segmentation correction;
Conference_Titel :
Electronic Computer Technology, 2009 International Conference on
Conference_Location :
Macau
Print_ISBN :
978-0-7695-3559-3
DOI :
10.1109/ICECT.2009.89