Title :
Keywords Extracting as Text Chance Discovery
Author :
Zhang, Zhenya ; Cheng, Hongmei
Author_Institution :
Univ. of Sci. & Technol., Hefei
Abstract :
Keyword auto-extracting is focused by researchers on information retrieval, data mining, chance discovery and others application. In this paper, new algorithm, CCG(Cognition & Concept Graph, for text chance discovery is presented based on cognition with data depth as measurement. When the keywords in a document are treated as chances in the document, those keywords can be extracted by CGC automatically. In CGC, concepts of a document are represented as maximum connected sub graphs of the basic graph for the document and the cognition of reader/author on a term is weighted with data depth. The correlation for word and concept is defined and the formula for the correlation calculating is given. Experimental results show that keywords extracted by CCG can describe the document and author/reader´s cognition much better than keywords extracted by others technologies such as frequency accumulating or Key Graph.
Keywords :
information retrieval; text analysis; cognition & concept graph; data mining; information retrieval; keywords extraction; maximum connected subgraphs; text chance discovery; Cognition; Data engineering; Data mining; Frequency; Industrial electronics; Information retrieval; Laboratories; Man machine systems; Multimedia computing; Visualization;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
DOI :
10.1109/FSKD.2007.373