مرکز منطقه ای اطلاع رساني علوم و فناوري - An Automatic Inequality Prover and Instance Optimal Identity Testing

DocumentCode :

180729

Title :

An Automatic Inequality Prover and Instance Optimal Identity Testing

Author :

Valiant, Gregory ; Valiant, Paul

Author_Institution :

Stanford Univ., Stanford, CA, USA

fYear :

2014

fDate :

18-21 Oct. 2014

Firstpage :

Lastpage :

Abstract :

We consider the problem of verifying the identity of a distribution: Given the description of a distribution over a discrete support p = (p₁, p₂, ... , p_n), how many samples (independent draws) must one obtain from an unknown distribution, q, to distinguish, with high probability, the case that p = q from the case that the total variation distance (L₁ distance) ||p - q||1≥ ϵ? We resolve this question, up to constant factors, on an instance by instance basis: there exist universal constants c, c´ and a function f(p, ϵ) on distributions and error parameters, such that our tester distinguishes p = q from ||p-q||1≥ ϵ using f(p, ϵ) samples with success probability > 2/3, but no tester can distinguish p = q from ||p - q||1≥ c · ϵ when given c´ · f(p, ϵ) samples. The function f(p, ϵ) is upperbounded by a multiple of ||p||2/3/ϵ², but is more complicated, and is significantly smaller in some cases when p has many small domain elements, or a single large one. This result significantly generalizes and tightens previous results: since distributions of support at most n have L_2/3 norm bounded by √n, this result immediately shows that for such distributions, O(√n/ϵ²) samples suffice, tightening the previous bound of O(√npolylog/n⁴) for this class of distributions, and matching the (tight) known results for the case that p is the uniform distribution over support n. The analysis of our very simple testing algorithm involves several hairy inequalities. To facilitate this analysis, we give a complete characterization of a general class of inequalities- generalizing Cauchy-Schwarz, Holder´s inequality, and the monotonicity of L_p norms. Specifically, we characterize the set of sequences (a)_i = a₁, . . . , ar, (b)i = b_{1<- sub>, . . . , br, (c)i = c₁, ... , cr, for which it holds that for all finite sequences of positive numbers (x)_j = x₁,... and (y)_j = y₁,...,Π_i=1^r (Σ_jx^a_jⁱ_y_i^bⁱ)^ci≥1. For example, the standard Cauchy-Schwarz inequality corresponds to the sequences a = (1, 0, 1/2), b = (0,1, 1/2), c = (1/2 , 1/2 , -1). Our characterization is of a non-traditional nature in that it uses linear programming to compute a derivation that may otherwise have to be sought throu.gh trial and error, by hand. We do not believe such a characterization has appeared in the literature, and hope its computational nature will be useful to others, and facilitate analyses like the one here.}

Keywords :

computational complexity; linear programming; probability; theorem proving; L_p norm monotonicity; O(√n/ε²); O(√npolylog/n⁴) bound; automatic inequality prover; bounded L_2/3 norm; constant factors; distribution identity verification; domain elements; error parameters; finite positive number sequences; general inequalities; generalized Cauchy-Schwarz inequality; generalized Holder inequality; hairy inequalities; instance optimal identity testing; linear programming; probability; standard Cauchy-Schwarz inequality; testing algorithm; tight-known result matching; total variation distance; universal constants; unknown distribution parameters; upper-bounded function; Algorithm design and analysis; Complexity theory; Linear programming; Polynomials; Testing; Vectors; Automated Theorem Proving; Cauchy-Schwarz inequality; Identity Testing; Instance Optimal; Property Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Foundations of Computer Science (FOCS), 2014 IEEE 55th Annual Symposium on

Conference_Location :

Philadelphia, PA

ISSN :

0272-5428

Type :

conf

DOI :

10.1109/FOCS.2014.14

Filename :

6978989

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=180729