Title :
Automatic keywords extraction of Chinese document using small world structure
Author :
Mengxiao, Zhu ; Zhi, Cai ; Qingsheng, Cai
Author_Institution :
Dept. of Comput. Sci., Univ. of Sci. & Technol. of China, Hefei, China
Abstract :
Small world structure characterized by short characteristic path length and high clustering coefficient is widely observed in many natural and man-made systems. Inspired by the small word structure found in English, we analyzed Chinese documents and construct cooccurrence networks to representing the correlations of words, where nodes are words and the cooccurrences of words in the same sentence make up links. The cooccurrence networks show obvious small world properties. By utilizing the impact of each node´s absence on the characteristic path length of the network, we extract keywords, which can well represent the contents of the document.
Keywords :
natural languages; word processing; Chinese document; automatic keyword extraction; cooccurrence network; word correlation; Biological system modeling; Books; Bridges; Character generation; Computer science; Humans; Mesh generation; Natural languages; Power grids; Social network services;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-7902-0
DOI :
10.1109/NLPKE.2003.1275946