DocumentCode :
2543261
Title :
txtKnot — Text clustering based concept hierarchy to generalize from different text sources
Author :
Jayasinghe, Diluk ; Hettiarachchi, Salinda ; Abeywickrama, Sandu ; Ketteepearachchi, Charith ; Alahakoon, Damminda ; Matharage, Sumith ; Gunasinghe, Upuli
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Moratuwa, Moratuwa, Sri Lanka
fYear :
2010
fDate :
17-19 Dec. 2010
Firstpage :
239
Lastpage :
243
Abstract :
Living in the modern technology dependent world, we heavily rely on electronically stored data and information, to come up with sound and timely decisions. Considering the entire information technology world, there exists an unimaginable volume of data which contains a lot of information which is relevant to various kinds of fields. But the problem emerges when we are interested to find out about a particular subject. This is due to its scattered nature of relevant and non-relevant data. Therefore it is fair to say that there exists a critical need for a system which could create an ordered structure that provides a way of modeling the underlying relationships of data elements which will ultimately result in a much easier process of decision making. txtKnot is all about solving the above problem by generating a meaningful hierarchy of concepts from a set of unsorted text documents, thus enabling the visualization of relationships that exist within the set of documents. It consists of four main components namely, Data Extraction Module, Data Pre-processor Module, Text Clustering Module and Concept Hierarchy Generation Module. These four components are integrated together in order to fulfill the main objective of providing an easy to use method of organizing, visualizing, searching and filtering of the huge amount of electronically available unsorted textual data.
Keywords :
data mining; pattern clustering; text analysis; concept hierarchy generation module; data extraction module; data mining; data pre-processor module; electronically stored data; information technology; relationship visualization; text clustering module; txtKnot; Algorithm design and analysis; Artificial neural networks; Clustering algorithms; Data mining; Databases; Knowledge based systems; Semantics; Text clustering; growing self-organizing maps; hierarchical systems; knowledge discovery; neural networks; semantic relation graph; unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Automation for Sustainability (ICIAFs), 2010 5th International Conference on
Conference_Location :
Colombo
Print_ISBN :
978-1-4244-8549-9
Type :
conf
DOI :
10.1109/ICIAFS.2010.5715666
Filename :
5715666
Link To Document :
بازگشت