DocumentCode :
3270150
Title :
Research of intelligent word segmentation and information retrieval
Author :
Li, Xiaofei ; Xie, Xusheng
Author_Institution :
Inst. of Comput. Inf. & Eng., Jiangxi Normal Univ., Nanchang, China
Volume :
5
fYear :
2010
fDate :
22-24 June 2010
Abstract :
Chinese information retrieval process is somewhat different from the English information retrieval process. In consideration of the existing problems and difficulties of Chinese language information processing, Hibernate search was introduced to exploit information retrieval engine in this paper. A Chinese language analyzer based on the word stock was adopted to process Chinese language information, therefore this analyzer could advance with the times by updating the word stock at any time. However, ambiguity errors caused by the Chinese language analyzer always interfered with the degree of accuracy of the result. During the time of information retrieval, a secondary word segmentation algorithm was used in order to improve Chinese language information retrieval precision. The result list given in this paper had shown that the intelligent Chinese segmentation algorithm had improved the system performance well.
Keywords :
information retrieval; word processing; Chinese information retrieval; english information retrieval; information retrieval engine; intelligent word segmentation; Indexes; Information analysis; Information processing; Information retrieval; Natural languages; Hibernate search; Information retrieval; Word Segmentation; Word sense disambiguation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Education Technology and Computer (ICETC), 2010 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-6367-1
Type :
conf
DOI :
10.1109/ICETC.2010.5529961
Filename :
5529961
Link To Document :
بازگشت