DocumentCode
3026597
Title
Density-Based Data Streams Subspace Clustering over Weighted Sliding Windows
Author
Ren, Jiadong ; Cao, Shiyuan ; Hu, Changzhen
Author_Institution
Coll. of Inf. Sci. & Eng., Yanshan Univ., Qinhuangdao, China
fYear
2010
fDate
23-24 Oct. 2010
Firstpage
212
Lastpage
216
Abstract
Most real-world data sets are characterized by a high dimensinal, inherely sparse data space. In this paper, we present a novel density-based approach to the subspace clustering problem. A new framework for data stream mining is introduced, called the weighted sliding window. In the online component, the structure of Exponential Histogram of Cluster Feature(EHCF) is improved to maintain the micro-clusters. The concepts of potential core-micro-cluster and outlier micro-cluster are applied to distinguish the potential clusters and outliers. A novel pruning strategy is proposed to decrease the number of micro-clusters. In the offline component, the final clusters are generated by SUBCLU algorithm. Our performance study demonstrates the effectiveness and efficiency of our algorithm.
Keywords
data mining; pattern clustering; SUBCLU algorithm; core-microclusters; data stream mining; density-based data streams subspace clustering; exponential histogram of cluster feature; outlier microcluster; pruning strategy; sparse data space; weighted sliding windows; Algorithm design and analysis; Clustering algorithms; Data mining; Data models; Finite element methods; Histograms; Knowledge engineering; data stream; density-based; subspace clustering; weighted sliding windows;
fLanguage
English
Publisher
ieee
Conference_Titel
Cryptography and Network Security, Data Mining and Knowledge Discovery, E-Commerce & Its Applications and Embedded Systems (CDEE), 2010 First ACIS International Symposium on
Conference_Location
Qinhuangdao
Print_ISBN
978-1-4244-9595-5
Type
conf
DOI
10.1109/CDEE.2010.48
Filename
5759375
Link To Document