DocumentCode :
1595180
Title :
Isotropic PCA and Affine-Invariant Clustering
Author :
Brubaker, Charles S. ; Vempala, Santosh S.
Author_Institution :
Georgia Inst. of Technol., Atlanta, GA
fYear :
2008
Firstpage :
551
Lastpage :
560
Abstract :
We present an extension of principal component analysis (PCA) and a new algorithm for clustering points in Rn based on it. The key property of the algorithm is that it is affine-invariant. When the input is a sample from a mixture of two arbitrary Gaussians, the algorithm correctly classifies the sample assuming only that the two components are separable by a hyperplane, i.e., there exists a halfspace that contains most of one Gaussian and almost none of the other in probability mass. This is nearly the best possible, improving known results substantially. For k>2 components, the algorithm requires only that there be some (k-1)-dimensional subspace in which the ``overlap´´ in every direction is small. Our main tools are isotropic transformation, spectral projection and a simple reweighting technique. We call this combination isotropic PCA.
Keywords :
Gaussian processes; affine transforms; pattern clustering; principal component analysis; probability; affine-invariant clustering; arbitrary Gaussians; isotropic PCA; isotropic transformation; principal component analysis; probability mass; reweighting technique; spectral projection; Clustering algorithms; Computer science; Covariance matrix; Gaussian distribution; Gaussian processes; Labeling; Pattern recognition; Polynomials; Principal component analysis; clustering; mixture models; principal components analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Foundations of Computer Science, 2008. FOCS '08. IEEE 49th Annual IEEE Symposium on
Conference_Location :
Philadelphia, PA
ISSN :
0272-5428
Print_ISBN :
978-0-7695-3436-7
Type :
conf
DOI :
10.1109/FOCS.2008.48
Filename :
4690988
Link To Document :
بازگشت