DocumentCode :
2771269
Title :
Self-Adaptive Anytime Stream Clustering
Author :
Kranen, Philipp ; Assent, Ira ; Baldauf, Corinna ; Seidl, Thomas
Author_Institution :
RWTH Aachen Univ., Aachen, Germany
fYear :
2009
fDate :
6-9 Dec. 2009
Firstpage :
249
Lastpage :
258
Abstract :
Clustering streaming data requires algorithms which are capable of updating clustering results for the incoming data. As data is constantly arriving, time for processing is limited. Clustering has to be performed in a single pass over the incoming data and within the possibly varying inter-arrival times of the stream. Likewise, memory is limited, making it impossible to store all data. For clustering, we are faced with the challenge of maintaining a current result that can be presented to the user at any given time. In this work, we propose a parameter free algorithm that automatically adapts to the speed of the data stream. It makes best use of the time available under the current constraints to provide a clustering of the objects seen up to that point. Our approach incorporates the age of the objects to reflect the greater importance of more recent data. Moreover, we are capable of detecting concept drift, novelty and outliers in the stream. For efficient and effective handling, we introduce the ClusTree, a compact and self-adaptive index structure for maintaining stream summaries. Our experiments show that our approach is capable of handling a multitude of different stream characteristics for accurate and scalable anytime stream clustering.
Keywords :
pattern clustering; tree data structures; ClusTree; parameter free algorithm; self-adaptive anytime stream clustering; self-adaptive index structure; Adaptive algorithm; Algorithm design and analysis; Clustering algorithms; Consumer behavior; Data analysis; Data mining; Memory management; Partitioning algorithms; Sensor phenomena and characterization; Time factors; anytime algorithms; self-adaptive algorithms; stream clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
Conference_Location :
Miami, FL
ISSN :
1550-4786
Print_ISBN :
978-1-4244-5242-2
Electronic_ISBN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2009.47
Filename :
5360250
Link To Document :
بازگشت