Title :
Model-Based Method for Projective Clustering
Author :
Chen, Lifei ; Jiang, Qingshan ; Wang, Shengrui
Author_Institution :
Sch. of Math. & Comput. Sci., Fujian Normal Univ., Fuzhou, China
fDate :
7/1/2012 12:00:00 AM
Abstract :
Clustering high-dimensional data is a major challenge due to the curse of dimensionality. To solve this problem, projective clustering has been defined as an extension to traditional clustering that attempts to find projected clusters in subsets of the dimensions of a data space. In this paper, a probability model is first proposed to describe projected clusters in high-dimensional data space. Then, we present a model-based algorithm for fuzzy projective clustering that discovers clusters with overlapping boundaries in various projected subspaces. The suitability of the proposal is demonstrated in an empirical study done with synthetic data set and some widely used real-world data set.
Keywords :
fuzzy set theory; pattern clustering; probability; cluster discovery; data dimensionality; fuzzy projective clustering; high-dimensional data clustering; high-dimensional data space; model-based method; overlapping boundaries; probability model; projected subspaces; real-world data set; synthetic data set; Analytical models; Clustering algorithms; Data mining; Data models; Electronic mail; Gene expression; Proposals; Clustering; high dimensions; probability model.; projective clustering;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2010.256