DocumentCode
1595180
Title
Isotropic PCA and Affine-Invariant Clustering
Author
Brubaker, Charles S. ; Vempala, Santosh S.
Author_Institution
Georgia Inst. of Technol., Atlanta, GA
fYear
2008
Firstpage
551
Lastpage
560
Abstract
We present an extension of principal component analysis (PCA) and a new algorithm for clustering points in Rn based on it. The key property of the algorithm is that it is affine-invariant. When the input is a sample from a mixture of two arbitrary Gaussians, the algorithm correctly classifies the sample assuming only that the two components are separable by a hyperplane, i.e., there exists a halfspace that contains most of one Gaussian and almost none of the other in probability mass. This is nearly the best possible, improving known results substantially. For k>2 components, the algorithm requires only that there be some (k-1)-dimensional subspace in which the ``overlap´´ in every direction is small. Our main tools are isotropic transformation, spectral projection and a simple reweighting technique. We call this combination isotropic PCA.
Keywords
Gaussian processes; affine transforms; pattern clustering; principal component analysis; probability; affine-invariant clustering; arbitrary Gaussians; isotropic PCA; isotropic transformation; principal component analysis; probability mass; reweighting technique; spectral projection; Clustering algorithms; Computer science; Covariance matrix; Gaussian distribution; Gaussian processes; Labeling; Pattern recognition; Polynomials; Principal component analysis; clustering; mixture models; principal components analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Foundations of Computer Science, 2008. FOCS '08. IEEE 49th Annual IEEE Symposium on
Conference_Location
Philadelphia, PA
ISSN
0272-5428
Print_ISBN
978-0-7695-3436-7
Type
conf
DOI
10.1109/FOCS.2008.48
Filename
4690988
Link To Document