• DocumentCode
    1595180
  • Title

    Isotropic PCA and Affine-Invariant Clustering

  • Author

    Brubaker, Charles S. ; Vempala, Santosh S.

  • Author_Institution
    Georgia Inst. of Technol., Atlanta, GA
  • fYear
    2008
  • Firstpage
    551
  • Lastpage
    560
  • Abstract
    We present an extension of principal component analysis (PCA) and a new algorithm for clustering points in Rn based on it. The key property of the algorithm is that it is affine-invariant. When the input is a sample from a mixture of two arbitrary Gaussians, the algorithm correctly classifies the sample assuming only that the two components are separable by a hyperplane, i.e., there exists a halfspace that contains most of one Gaussian and almost none of the other in probability mass. This is nearly the best possible, improving known results substantially. For k>2 components, the algorithm requires only that there be some (k-1)-dimensional subspace in which the ``overlap´´ in every direction is small. Our main tools are isotropic transformation, spectral projection and a simple reweighting technique. We call this combination isotropic PCA.
  • Keywords
    Gaussian processes; affine transforms; pattern clustering; principal component analysis; probability; affine-invariant clustering; arbitrary Gaussians; isotropic PCA; isotropic transformation; principal component analysis; probability mass; reweighting technique; spectral projection; Clustering algorithms; Computer science; Covariance matrix; Gaussian distribution; Gaussian processes; Labeling; Pattern recognition; Polynomials; Principal component analysis; clustering; mixture models; principal components analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Foundations of Computer Science, 2008. FOCS '08. IEEE 49th Annual IEEE Symposium on
  • Conference_Location
    Philadelphia, PA
  • ISSN
    0272-5428
  • Print_ISBN
    978-0-7695-3436-7
  • Type

    conf

  • DOI
    10.1109/FOCS.2008.48
  • Filename
    4690988