• DocumentCode
    1698038
  • Title

    Learning Convex Concepts from Gaussian Distributions with PCA

  • Author

    Vempala, Santosh S.

  • Author_Institution
    Sch. of Comput. Sci., Georgia Tech, Atlanta, GA, USA
  • fYear
    2010
  • Firstpage
    124
  • Lastpage
    130
  • Abstract
    We present a new algorithm for learning a convex set in n-dimensional space given labeled examples drawn from any Gaussian distribution. The complexity of the algorithm is bounded by a fixed polynomial in n times a function of k and ϵ where k is the dimension of the normal subspace (the span of normal vectors to supporting hyperplanes of the convex set) and the output is a hypothesis that correctly classifies at least 1 - ϵ of the unknown Gaussian distribution. For the important case when the convex set is the intersection of k halfspaces, the complexity is poly(n, k, 1/ϵ) + n · min k(O(log k/ϵ4)), (k/ϵ)O(k), improving substantially on the state of the art [Vem04], [KOS08] for Gaussian distributions. The key step of the algorithm is a Singular Value Decomposition after applying a normalization. The proof is based on a monotonicity property of Gaussian space under convex restrictions.
  • Keywords
    Gaussian distribution; computational complexity; learning (artificial intelligence); principal component analysis; set theory; singular value decomposition; Gaussian distribution; Gaussian space; PCA; convex restriction; convex set; learning; monotonicity property; polynomial complexity; singular value decomposition; Accuracy; Algorithm design and analysis; Complexity theory; Covariance matrix; Gaussian distribution; Polynomials; Principal component analysis; Gaussians; High-dimensional learning; PCA; convex; polynomial time;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Foundations of Computer Science (FOCS), 2010 51st Annual IEEE Symposium on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    0272-5428
  • Print_ISBN
    978-1-4244-8525-3
  • Type

    conf

  • DOI
    10.1109/FOCS.2010.19
  • Filename
    5670814