Title :
Learning mixtures of product distributions over discrete domains
Author :
Feldman, Jon ; O´Donnell, Ryan ; Servedio, Rocco A.
Author_Institution :
Dept. of IEOR, Columbia Univ., NY, USA
Abstract :
We consider the problem of learning mixtures of product distributions over discrete domains in the distribution learning framework introduced by Kearns et al. (1994). We give a poly(n/ε) time algorithm for learning a mixture of k arbitrary product distributions over the n-dimensional Boolean cube {0, 1}n to accuracy ε, for any constant k. Previous poly(n)-time algorithms could only achieve this for k = 2 product distributions; our result answers an open question stated independently in M. Cryan (1999) and Y. Freund and Y. Mansour (1999). We further give evidence that no polynomial time algorithm can succeed when k is superconstant, by reduction from a notorious open problem in PAC learning. Finally, we generalize our poly(n/ε) time algorithm to learn any mixture of k = O(1) product distributions over {0, 1,... , b }n,for any b = O(1).
Keywords :
Boolean functions; computational complexity; learning (artificial intelligence); probability; PAC learning; distribution learning framework; learning mixtures; n-dimensional Boolean cube; poly(n/ε) time algorithm; product distributions; Artificial intelligence; Computer science; Engineering profession; Entropy; Geology; Learning; Polynomials; Probability distribution;
Conference_Titel :
Foundations of Computer Science, 2005. FOCS 2005. 46th Annual IEEE Symposium on
Print_ISBN :
0-7695-2468-0
DOI :
10.1109/SFCS.2005.46