DocumentCode :
1701346
Title :
Agnostically Learning under Permutation Invariant Distributions
Author :
Wimmer, Karl
Author_Institution :
Math. & Comput. Sci. Dept., Duquesne Univ., Pittsburgh, PA, USA
fYear :
2010
Firstpage :
113
Lastpage :
122
Abstract :
We generalize algorithms from computational learning theory that are successful under the uniform distribution on the Boolean hypercube {0,1}n to algorithms successful on permutation invariant distributions. A permutation invariant distribution is a distribution where the probability mass remains constant upon permutations in the instances. While the tools in our generalization mimic those used for the Boolean hypercube, the fact that permutation invariant distributions are not product distributions presents a significant obstacle. Under the uniform distribution, halfspaces can be agnostically learned in polynomial time for constant e. The main tools used are a theorem of Peres [Per04] bounding the noise sensitivity of a halfspace, a result of [KOS04] that this theorem implies Fourier concentration, and a modification of the Low-Degree algorithm of Linial, Mansour, Nisan [LMN93] made by Kalai et. al. [KKMS08]. These results are extended to arbitrary product distributions in [BOW08]. We prove analogous results for permutation invariant distributions; more generally, we work in the domain of the symmetric group. We define noise sensitivity in this setting, and show that noise sensitivity has a nice combinatorial interpretation in terms of Young tableaux. The main technical innovations involve techniques from the representation theory of the symmetric group, especially the combinatorics of Young tableaux. We show that low noise sensitivity implies concentration on "simple" components of the Fourier spectrum, and that this fact will allow us to agnostically learn halfspaces under permutation invariant distributions to constant accuracy in roughly the same time as in the uniform distribution over the Boolean hypercube case.
Keywords :
Boolean algebra; Fourier analysis; combinatorial mathematics; computational complexity; learning (artificial intelligence); Boolean hypercube; Fourier concentration; Fourier spectrum; Young tableaux; agnostically learning; arbitrary product distributions; combinatorial interpretation; combinatorics; computational learning theory; low noise sensitivity; low-degree algorithm; permutation invariant distributions; polynomial time; probability mass; representation theory; symmetric group; theorem of Peres; uniform distribution; Hypercubes; Loss measurement; Noise; Polynomials; Sensitivity; Support vector machines; Tin; Boolean functions; Fourier analysis; agnostic learning; representation theory; symmetric group;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Foundations of Computer Science (FOCS), 2010 51st Annual IEEE Symposium on
Conference_Location :
Las Vegas, NV
ISSN :
0272-5428
Print_ISBN :
978-1-4244-8525-3
Type :
conf
DOI :
10.1109/FOCS.2010.17
Filename :
5670941
Link To Document :
بازگشت