DocumentCode :
2200525
Title :
Topic clustering and topic evolution based on temporal parameters
Author :
Jayashri, M. ; Chitra, P.
Author_Institution :
Dept. of Comput. Sci. & Eng., Thiagarajar Coll. of Eng., Madurai, India
fYear :
2012
fDate :
19-21 April 2012
Firstpage :
559
Lastpage :
564
Abstract :
Historical documents contains extreme amount of information about past events, often in unstructured form. Once dates and document names are identified that can differ by genre, we examine collections to detect events. Temporal text mining (TTM) is used to ascertain the temporal patterns in text information collected over time. Trend analysis from the stream of text documents generally uses an approach based on topic detection and tracking (TDT). The task of topic detection is used to detect topics that are previously unknown to the system. Tracking generates the evolution of each topic over the period of arrival time. In this work the TDT task has been formulated as a clustering problem in a class of self-organizing neural networks, called the Adaptive Resonance Theory (ART) networks. We also propose that our algorithm has been able to detect hot topics automatically and track them with good accuracy. From our experimental studies we prove this by comparing the effectiveness of the different validity indices of simple k-means clustering method. We also show the benchmarking results of different kinds of datasets.
Keywords :
data mining; pattern clustering; self-organising feature maps; text analysis; adaptive resonance theory network; clustering problem; event detection; historical document; k-means clustering method; self-organizing neural network; temporal parameter; temporal text mining; text information; topic clustering; topic detection; topic evolution; topic tracking; trend analysis; Accuracy; Algorithm design and analysis; Clustering algorithms; Prototypes; Subspace constraints; Support vector machine classification; Vectors; temporal text mining; topic clustering; topic detection; topic tracking; trend analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Recent Trends In Information Technology (ICRTIT), 2012 International Conference on
Conference_Location :
Chennai, Tamil Nadu
Print_ISBN :
978-1-4673-1599-9
Type :
conf
DOI :
10.1109/ICRTIT.2012.6206816
Filename :
6206816
Link To Document :
بازگشت