Title :
Knowledge Discovery from Semi-Structured Data for Conceptual Organization
Author :
Gupta, S. ; Goyal, R. ; Shubham, K. ; Dey, L. ; Malik, A. ; Chaudhury, S. ; Bhattacharya, S.
Author_Institution :
Dept. of Math., Indian Inst. of Technol., New Delhi
Abstract :
Conceptual organization of semi-structured documents can help in effective retrieval from collections of emails, product complaints, video descriptions etc. In this paper, we propose a conceptual organization scheme for grouping and categorizing semi-structured text data using natural language processing techniques. We propose a knowledge-discovery mechanism that extracts noun phrases from documents and arranges them into concept maps based on their co-occurrence. The emerging concept maps can be used for automatic grouping and conceptual categorization of documents. Further, phrase structure grammar is employed to extract relationships among these entities from documents and index the document collection with these relations
Keywords :
data mining; grammars; natural language processing; text analysis; vocabulary; automatic grouping; concept maps; conceptual categorization; conceptual organization; knowledge discovery; natural language processing; phrase structure grammar; semi-structured data; semi-structured documents; Clustering algorithms; Content based retrieval; Data mining; Extraterrestrial measurements; Indexing; Intelligent agent; Machine learning algorithms; Mathematics; Natural language processing; Ontologies;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology Workshops, 2006. WI-IAT 2006 Workshops. 2006 IEEE/WIC/ACM International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2749-3
DOI :
10.1109/WI-IATW.2006.86