DocumentCode :
3710048
Title :
Probabilistic Polynomials and Hamming Nearest Neighbors
Author :
Josh Alman;Ryan Williams
Author_Institution :
Comput. Sci. Dept. Stanford, Stanford Univ., Stanford, CA, USA
fYear :
2015
Firstpage :
136
Lastpage :
150
Abstract :
We show how to compute any symmetric Boolean function on n variables over any field (as well as ´/ the integers) with a probabilistic polynomial of degree O( √nlog(1/ε)) and error at most ε. The degree dependence on n and ε is optimal, matching a lower bound of Razborov (1987) and Smolensky (1987) for the MAJORITY function. The proof is constructive: a low-degree polynomial can be efficiently sampled from the distribution. This polynomial construction is combined with other algebraic ideas to give the first subquadratic time algorithm for computing a (worst-case) batch of Hamming distances in superlogarithmic dimensions, exactly. To illustrate, let c(n) : ℕ → ℕ. Suppose we are given a database D of n vectors in {0,1}c(n)logn and a collection of n query vectors Q in the same dimension. For all u ∈ Q, we wish to compute a v ∈ D with minimum Hamming distance from u. We solve this problem in n2-1/O(c(n)log2c(n)) randomized time. Hence, the problem is in “truly subquadratic” time for O(logn) dimensions, and in subquadratic time for d = o((log2 n)/(loglogn)2). We apply the algorithm to computing pairs with maximum inner product, closest pair in ℓ1 for vectors with bounded integer entries, and pairs with maximum Jaccard coefficients.
Keywords :
"Polynomials","Probabilistic logic","Hamming distance","Databases","Approximation algorithms","Approximation methods","Boolean functions"
Publisher :
ieee
Conference_Titel :
Foundations of Computer Science (FOCS), 2015 IEEE 56th Annual Symposium on
ISSN :
0272-5428
Type :
conf
DOI :
10.1109/FOCS.2015.18
Filename :
7354392
Link To Document :
بازگشت