DocumentCode
256687
Title
Chinese Keyword Extraction Using Semantically Weighted Network
Author
Qian Chen ; Zengru Jiang ; Jinqiang Bian
Author_Institution
Sch. of Autom., Beijing Inst. of Technol., Beijing, China
Volume
2
fYear
2014
fDate
26-27 Aug. 2014
Firstpage
83
Lastpage
86
Abstract
The complex network theory is widely used in the field of keyword extraction. Through analyzing the insufficient of keyword extraction algorithms using traditional complex network, this paper proposes a new method to extract Chinese keyword based on semantically weighted network. On the basis of K-nearest neighbor coupling network, we build semantically weighted network according to the co-occurrence frequency and semantic similarity of the words in the text. We calculate the betweenness value, clustering coefficient variation and shortest path variation of the word node in the network to obtain the comprehensive eigenvalue of each word. According to the size of comprehensive eigenvalue, we extract text keyword. The experimental results show that the keywords extracted by this method can reflect the theme of the text better, and the accuracy has been significantly improved.
Keywords
complex networks; eigenvalues and eigenfunctions; natural language processing; network theory (graphs); pattern clustering; text analysis; word processing; Chinese text keyword extraction; K-nearest neighbor coupling network; complex network theory; keyword cooccurrence frequency; keyword eigenvalue; keyword semantic similarity; semantically weighted network; word node betweenness value; word node clustering coefficient variation; word node shortest path variation; Accuracy; Artificial intelligence; Complex networks; Computers; Dictionaries; Eigenvalues and eigenfunctions; Semantics; betweenness; comprehensive eigenvalue; keyword extraction; semantically weighted network;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2014 Sixth International Conference on
Conference_Location
Hangzhou
Print_ISBN
978-1-4799-4956-4
Type
conf
DOI
10.1109/IHMSC.2014.123
Filename
6911454
Link To Document