DocumentCode
3151734
Title
Feature selection for composite hypothesis testing with small samples: Fundamental limits and algorithms
Author
Dayu Huang ; Meyn, Sean
Author_Institution
CSL & ECE, Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear
2012
fDate
25-30 March 2012
Firstpage
1917
Lastpage
1920
Abstract
This paper considers the problem of feature selection for composite hypothesis testing: The goal is to select, from m candidate features, r relevant ones for distinguishing the null hypothesis from the composite alternative hypothesis; the training data are given as L sequences of observations, of which each is an n-sample sequence coming from one distribution in the alternative hypothesis. What is the fundamental limit for successful feature selection? Are there any algorithms that achieve this limit? We investigate this problem in a small-sample high-dimensional setting, with n = o(m), and obtain a tight pair of achievability and converse results: (i) There exists a function f(L, n, r,m) such that if f(L, n, r,m) ↓ 0, then no asymptotically consistent feature selection algorithm exists; (ii) We propose a feature selection algorithm that is asymptotically consistent whenever f(L, n, r,m) ↑ ∞.
Keywords
computational complexity; learning (artificial intelligence); sampling methods; statistical testing; achievability; asymptotically consistent feature selection algorithm; composite alternative hypothesis; composite hypothesis testing; fundamental limit; n-sample sequence; null hypothesis; small-sample high-dimensional setting; supervised learning; training data; Algorithm design and analysis; Complexity theory; Reactive power; Testing; Training data; US Government; USA Councils; Feature selection; composite hypothesis testing; high-dimensional model; small sample; supervised learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288279
Filename
6288279
Link To Document