Title :
An improved method of keywords extraction based on short technology text
Author :
Wang, Jun ; Li, Lei ; Ren, Fuji
Author_Institution :
Sch. of Comput., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
Keywords are the critical resources of information management and retrieval, automatic text classification and clustering. The keywords extraction plays an important role in the process of constructing structured text. Current algorithms of keywords extraction have matured in some ways. However the errors of word segmentation which caused by unknown words have been affected the performance of Chinese keywords extraction, particularly in the field of technological text. In order to solve the problem, this paper proposes an improved method of keywords extraction based on the relationship among words. Experiments show that the proposed method can effectively correct the errors caused by segmentation and improve the performance of keywords extraction, and it can also extend to other areas.
Keywords :
text analysis; word processing; Chinese keywords extraction; keywords extraction method; short technology text; structured text construction; word relationship; word segmentation; Algorithm design and analysis; Cows; Employment; Semantics; Steel; Tagging; Training; Short technology text; improved method; keywords extraction; unknown words;
Conference_Titel :
Natural Language Processing and Knowledge Engineering (NLP-KE), 2010 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-6896-6
DOI :
10.1109/NLPKE.2010.5587797