DocumentCode
2914077
Title
Hardness of Learning Halfspaces with Noise
Author
Guruswami, Venkatesan ; Raghavendra, Prasad
Author_Institution
Dept. of Comput. Sci. & Eng., Washington Univ., Seattle, WA
fYear
2006
fDate
Oct. 2006
Firstpage
543
Lastpage
552
Abstract
Learning an unknown halfspace (also called a perceptron) from, labeled examples is one of the classic problems in machine learning. In the noise-free case, when a half-space consistent with all the training examples exists, the problem can be solved in polynomial time using linear programming. However, under the promise that a halfspace consistent with a fraction (1 - epsiv) of the examples exists (for some small constant epsiv > 0), it was not known how to efficiently find a halfspace that is correct on even 51% of the examples. Nor was a hardness result that ruled out getting agreement on more than 99.9% of the examples known. In this work, we close this gap in our understanding, and prove that even a tiny amount of worst-case noise makes the problem of learning halfspaces intractable in a strong sense. Specifically, for arbitrary epsiv,delta > 0, we prove that given a set of examples-label pairs from the hypercube a fraction (1 - epsiv) of which can be explained by a halfspace, it is NP-hard to find a halfspace that correctly labels a fraction (frac12 + delta) of the examples. The hardness result is tight since it is trivial to get agreement on frac12 the examples. In learning theory parlance, we prove that weak proper agnostic learning of halfspaces is hard. This settles a question that was raised by Blum et. al in their work on learning halfspaces in the presence of random classification noise (A. Blum et. al, 1996), and in some more recent works as well. Along the way, we also obtain a strong hardness for another basic computational problem: solving a linear system over the rationals
Keywords
computational complexity; learning (artificial intelligence); linear programming; linear systems; perceptrons; NP-hardness; agnostic learning; halfspace learning; linear programming; linear system; machine learning; perceptron; polynomial time; Computer science; Equations; Hypercubes; Linear programming; Linear systems; Machine learning; Occupational stress; Polynomials; Testing; Veins;
fLanguage
English
Publisher
ieee
Conference_Titel
Foundations of Computer Science, 2006. FOCS '06. 47th Annual IEEE Symposium on
Conference_Location
Berkeley, CA
ISSN
0272-5428
Print_ISBN
0-7695-2720-5
Type
conf
DOI
10.1109/FOCS.2006.33
Filename
4031389
Link To Document