DocumentCode :
2127353
Title :
Automatic Chinese Keyword Extraction Based on KNN for Implicit Subject Extraction
Author :
Qingguo, Zhang ; Chengzhi, Zhang
Author_Institution :
Tongfang Knowledge Network Technol. Co., Ltd., Beijing
fYear :
2008
fDate :
21-22 Dec. 2008
Firstpage :
689
Lastpage :
692
Abstract :
In this paper, a method of automatic Chinese keyword extraction based on KNN is proposed. Firstly, it preprocesses the document by vector space model. Secondly, it constructs a set of candidate keywords based on KNN method and the labeled dataset. Finally, it post-processes on candidate keywords by the character of keyword to meet readers´ requirements Experimental results show the method proposed can not only improve the precision and recall of keyword extraction, but also extract implicit subject efficiently.
Keywords :
information retrieval; natural language processing; pattern classification; KNN method; automatic Chinese keyword extraction; implicit subject extraction; k-nearest neighbor; vector space model; Data mining; Data preprocessing; Euclidean distance; Frequency; Indexing; Information management; Knowledge acquisition; Learning systems; Statistics; Training data; Implicit Subject Extraction; KNN; Keyword Extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Knowledge Acquisition and Modeling, 2008. KAM '08. International Symposium on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3488-6
Type :
conf
DOI :
10.1109/KAM.2008.87
Filename :
4732916
Link To Document :
بازگشت