Title :
Incremental Subspace Clustering over Multiple Data Streams
Author :
Zhang, Qi ; Liu, Jinze ; Wang, Wei
Author_Institution :
Univ. of North Carolina, Charlotte
Abstract :
Data streams are often locally correlated, with a subset of streams exhibiting coherent patterns over a subset of time points. Subspace clustering can discover clusters of objects in different subspaces. However, traditional sub- space clustering algorithms for static data sets are not readily used for incremental clustering, and is very expensive for frequent re-clustering over dynamically changing stream data. In this paper, we present an efficient incremental sub- space clustering algorithm for multiple streams over sliding windows. Our algorithm detects all the delta-CC-Clusters, which capture the coherent changing patterns among a set of streams over a set of time points. delta-CC´-Cluster s are incrementally generated by traversing a directed acyclic graph pDAG. We propose efficient insertion and deletion operations to update the pDAG dynamically. In addition, effective pruning techniques are applied to reduce the search space. Experiments on real data sets demonstrate the performance of our algorithm.
Keywords :
data analysis; pattern clustering; delta-CC-Clusters; effective pruning technique; incremental subspace clustering; multiple data streams; sliding windows; static data sets; Algorithm design and analysis; Boolean functions; Clustering algorithms; Clustering methods; Data analysis; Data mining; Data structures; Monitoring; Subspace constraints; Telecommunication traffic;
Conference_Titel :
Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
Print_ISBN :
978-0-7695-3018-5
DOI :
10.1109/ICDM.2007.100