• DocumentCode
    3151734
  • Title

    Feature selection for composite hypothesis testing with small samples: Fundamental limits and algorithms

  • Author

    Dayu Huang ; Meyn, Sean

  • Author_Institution
    CSL & ECE, Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    1917
  • Lastpage
    1920
  • Abstract
    This paper considers the problem of feature selection for composite hypothesis testing: The goal is to select, from m candidate features, r relevant ones for distinguishing the null hypothesis from the composite alternative hypothesis; the training data are given as L sequences of observations, of which each is an n-sample sequence coming from one distribution in the alternative hypothesis. What is the fundamental limit for successful feature selection? Are there any algorithms that achieve this limit? We investigate this problem in a small-sample high-dimensional setting, with n = o(m), and obtain a tight pair of achievability and converse results: (i) There exists a function f(L, n, r,m) such that if f(L, n, r,m) ↓ 0, then no asymptotically consistent feature selection algorithm exists; (ii) We propose a feature selection algorithm that is asymptotically consistent whenever f(L, n, r,m) ↑ ∞.
  • Keywords
    computational complexity; learning (artificial intelligence); sampling methods; statistical testing; achievability; asymptotically consistent feature selection algorithm; composite alternative hypothesis; composite hypothesis testing; fundamental limit; n-sample sequence; null hypothesis; small-sample high-dimensional setting; supervised learning; training data; Algorithm design and analysis; Complexity theory; Reactive power; Testing; Training data; US Government; USA Councils; Feature selection; composite hypothesis testing; high-dimensional model; small sample; supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288279
  • Filename
    6288279