DocumentCode :
2348028
Title :
An improved method of keywords extraction based on short technology text
Author :
Wang, Jun ; Li, Lei ; Ren, Fuji
Author_Institution :
Sch. of Comput., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear :
2010
fDate :
21-23 Aug. 2010
Firstpage :
1
Lastpage :
6
Abstract :
Keywords are the critical resources of information management and retrieval, automatic text classification and clustering. The keywords extraction plays an important role in the process of constructing structured text. Current algorithms of keywords extraction have matured in some ways. However the errors of word segmentation which caused by unknown words have been affected the performance of Chinese keywords extraction, particularly in the field of technological text. In order to solve the problem, this paper proposes an improved method of keywords extraction based on the relationship among words. Experiments show that the proposed method can effectively correct the errors caused by segmentation and improve the performance of keywords extraction, and it can also extend to other areas.
Keywords :
text analysis; word processing; Chinese keywords extraction; keywords extraction method; short technology text; structured text construction; word relationship; word segmentation; Algorithm design and analysis; Cows; Employment; Semantics; Steel; Tagging; Training; Short technology text; improved method; keywords extraction; unknown words;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering (NLP-KE), 2010 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-6896-6
Type :
conf
DOI :
10.1109/NLPKE.2010.5587797
Filename :
5587797
Link To Document :
بازگشت