DocumentCode :
423514
Title :
A time-based self-organising model for document clustering
Author :
Hung, Chihli ; Wermter, Stefan
Author_Institution :
De Lin Inst. of Technol., Taiwan
Volume :
1
fYear :
2004
fDate :
25-29 July 2004
Lastpage :
22
Abstract :
Most current approaches for document clustering do not consider the non-stationary feature of real world document collection. In this paper, in a non-stationary environment, we propose a new self-organising model, namely the dynamic adaptive self-organising hybrid (DASH) model. The DASH model runs continuously since the new document set is formed consecutively for training while the old document set is still at the training stage. Knowledge learned from the old data set is adjusted to reflect the new data set and therefore document clusters are up-to-date. We test the performance of our model using the Reuters-RCV1 news corpus and obtain promising results based on the criteria of classification accuracy and average quantization error.
Keywords :
pattern clustering; self-organising feature maps; average quantization error; classification accuracy criteria; document clustering; dynamic adaptive self-organising hybrid model; time-based self-organising model; Artificial neural networks; Hybrid intelligent systems; Knowledge transfer; Learning systems; Prototypes; Quantization; Space technology; Stress; Technological innovation; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN :
1098-7576
Print_ISBN :
0-7803-8359-1
Type :
conf
DOI :
10.1109/IJCNN.2004.1379861
Filename :
1379861
Link To Document :
بازگشت