DocumentCode :
1728200
Title :
Optimality of coincidence-based goodness of fit test for sparse sample problems
Author :
Huang, Dayu ; Meyn, Sean
Author_Institution :
CSL & ECE, Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear :
2012
Firstpage :
344
Lastpage :
346
Abstract :
We consider the sparse sample goodness of fit problem, where the number of samples n is smaller than the size of the alphabet m. The generalized error exponent based on large deviation analysis was proposed to characterize the performance of tests, using the high-dimensional model in which both n and m tend to infinity and n = o(m). In previous work, the best achievable probability of error is shown to decay -log(Pe) = (n2/m)(1 + o(1))J with upper and lower bounds on J. However, there is a significant gap between the two bounds. In this paper, we close the gap by proving a tight upper-bound, which matches the lower-bound over the entire region of generalized error exponents of false alarm and missed detection, achieved by the coincidence-based test. This implies that the coincidence-based test is asymptotically optimal.
Keywords :
error statistics; sparse matrices; statistical testing; asymptotically optimal; best achievable error probability; coincidence-based goodness; coincidence-based test; false alarm; fit test; generalized error exponents; high-dimensional model; large deviation analysis; missed detection; optimality; sparse sample goodness; sparse sample problems; test performance; tight upper-bound; Analytical models; Biological system modeling; Convergence; Indexes; Materials; Probability; Speech processing; chi-square test; goodness of fit; high-dimensional model; large deviations; optimal test;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory and Applications Workshop (ITA), 2012
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4673-1473-2
Type :
conf
DOI :
10.1109/ITA.2012.6181779
Filename :
6181779
Link To Document :
بازگشت