Title :
Approximate Kernel Clustering
Author :
Khot, Subhash ; Naor, Assaf
Author_Institution :
Courant Inst. of Math. Sci., New York, NY
Abstract :
In the kernel clustering problem we are given a large ntimesn positive semi-definite matrix A=(aij) with Sigmai,j n=1 aij=0 and a small ktimesk positivesemi-definite matrix B=bij. The goal is to find a partition S1,..Sk of {1,...n} which maximizes the quantity Sigmai,j=1 k(Sigma(i,j)isinS i timesS j). We study the computational complexity of this generic clustering problem which originates in the theory of machine learning. We design a constant factor polynomial time approximation algorithm forthis problem, answering a question posed by Song, Smola, Gretton and Borgwardt. In some cases we manage to compute the sharp approximation threshold for this problem assuming the unique games conjecture (UGC). In particular, when B is the 3times3 identity matrix the UGC hardness threshold of this problem is exactly 16pi/27. We present and study a geometricconjecture of independent interest which we show would imply thatthe UGC threshold when B is the ktimesk identity matrix is 8pi/9(1-1/k) for every kges3.
Keywords :
approximation theory; computational complexity; game theory; matrix algebra; pattern clustering; approximate kernel clustering; computational complexity; polynomial time approximation algorithm; semi-definite matrix; unique games conjecture; Algorithm design and analysis; Approximation algorithms; Clustering algorithms; Computational complexity; Computer science; Kernel; Machine learning; Machine learning algorithms; Polynomials; User-generated content; Approximation algorithm; clustering; inapproximability;
Conference_Titel :
Foundations of Computer Science, 2008. FOCS '08. IEEE 49th Annual IEEE Symposium on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-0-7695-3436-7
DOI :
10.1109/FOCS.2008.33