Title :
Mining semantic networks for knowledge discovery
Author :
Rajaraman, K. ; Tan, Ah-Hwee
Author_Institution :
Inst. for Infocomm Res., Singapore, Singapore
Abstract :
We address the problem of mining a class of semantic networks, called concept frame graphs (CFG´s), for knowledge discovery from text. This new representation is motivated by the need to capture richer text content so that nontrivial mining tasks can be performed. We first define the CFG representation and then describe a rule-based algorithm for constructing a CFG from text documents. Treating the CFG as a networked knowledge base, we propose new methods for text mining. On a specific task of discovering the top companies in an area, we observe that our approach leads to simpler content mining algorithms, once the CFG has been constructed. Moreover, exploiting the network structure of CFG results in significant improvements in precision and recall.
Keywords :
data mining; frame based representation; knowledge based systems; semantic networks; text analysis; concept frame graphs; content mining algorithms; knowledge discovery; networked knowledge base; rule-based algorithm; semantic networks mining; text documents; text mining; Algorithm design and analysis; Computer hacking; Computer networks; Computer worms; Data mining; Knowledge based systems; Knowledge engineering; Organizing; Text mining; User interfaces;
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
DOI :
10.1109/ICDM.2003.1250995