Title :
Low Dimensional Localized Clustering (LDLC)
Author :
Tadavani, P.K. ; Ghodsi, Ali
Author_Institution :
David R. Cheriton Sch. of Comput. Sci., Univ. of Waterloo, Waterloo, ON, Canada
Abstract :
In the space of high-dimensional data, it is generally reasonable to assume that the data points are on (or close to) one or more submanifolds. Each of these submanifolds can be modeled by a number of linear subspaces. This is in fact the main intuition behind a majority of subspace clustering algorithms. In many cases, however, the subspaces computed by these algorithms consist of disconnected subsets of the underlying submanifolds and therefore, do not form localized and compact clusters. To address this problem, we propose "Low Dimensional Localized Clustering (LDLC)", a new method for subspace clustering. Unlike existing methods, LDLC respects the topology of the underling submanifolds and assigns the data points to localized clusters such that the total reconstruction error is minimized. This is a valuable property in many tasks, such as semi-supervised classification, data visualization and local dimensionality reduction. We establish connections between LDLC, K-Means, and VQPCA from different perspectives, and validate our method through various experiments on synthetic and real data sets.
Keywords :
pattern clustering; topology; VQPCA; data visualization; high-dimensional data; k-means; linear subspace; local dimensionality reduction; low dimensional localized clustering; semisupervised classification; submanifold; subspace clustering algorithm; topology; Clustering algorithms; Data visualization; Image reconstruction; Linear programming; Manifolds; Measurement uncertainty; Partitioning algorithms; Clustering; Dimensionality Reduction; Manifold;
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-4649-8
DOI :
10.1109/ICDM.2012.134