DocumentCode
506843
Title
Mining Representative Subspace Clusters in High-dimensional Data
Author
Chen, Guanhua ; Ma, Xiuli ; Yang, Dongqing ; Tang, Shiwei ; Shuai, Meng
Author_Institution
Sch. of EECS, Peking Univ., Beijing, China
Volume
1
fYear
2009
fDate
14-16 Aug. 2009
Firstpage
490
Lastpage
494
Abstract
A major challenge in subspace clustering is that subspace clustering may generate an explosive number of clusters with high computational complexity, which severely restricts the usage of subspace clustering. The problem gets even worse with the increase of the data´s dimensionality. In this paper, we propose to mine the representative subspace clusters in high-dimensional data to alleviate the problem. Typically, subspace clusters can be clustered further into groups, and several representative clusters can be generated from each group. Unfortunately, when the size of the set of representative clusters is specified, the problem of finding the optimal set is NP-hard. To solve this problem efficiently, we present an approximate method PCoC. The greatest advantage of our method is that we only need a subset of subspace clusters as the input. Our performance study shows the effectiveness and efficiency of the method.
Keywords
computational complexity; data mining; pattern clustering; NP-hard problem; approximate method PCoC; computational complexity; high dimensional data; partition based clustering on subspace cluster; representative subspace clusters mining; subspace clustering; Assembly; Clustering algorithms; Computational complexity; Data mining; Educational technology; Explosives; Fuzzy systems; Laboratories; Markov random fields; data mining; high dimensional data; representatives; subspace clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location
Tianjin
Print_ISBN
978-0-7695-3735-1
Type
conf
DOI
10.1109/FSKD.2009.463
Filename
5358525
Link To Document