• DocumentCode
    2710541
  • Title

    Direct Zero-Norm Optimization for Feature Selection

  • Author

    Huang, Kaizhu ; King, Irwin ; Lyu, Michael R.

  • Author_Institution
    Dept. of Eng. Math., Univ. of Bristol, Bristol
  • fYear
    2008
  • fDate
    15-19 Dec. 2008
  • Firstpage
    845
  • Lastpage
    850
  • Abstract
    Zero-norm, defined as the number of non-zero elements in a vector, is an ideal quantity for feature selection. However, minimization of zero-norm is generally regarded as a combinatorially difficult optimization problem. In contrast to previous methods that usually optimize a surrogate of zero-norm, we propose a direct optimization method to achieve zero-norm for feature selection in this paper. Based on Expectation Maximization (EM), this method boils down to solving a sequence of Quadratic Programming problems and hence can be practically optimized in polynomial time. We show that the proposed optimization technique has a nice Bayesian interpretation and converges to the true zero norm asymptotically, provided that a good starting point is given. Following the scheme of our proposed zero-norm, we even show that an arbitrary-norm based Support Vector Machine can be achieved in polynomial time. A series of experiments demonstrate that our proposed EM based zero-norm outperforms other state-of-the-art methods for feature selection on biological microarray data and UCI data, in terms of both the accuracy and the learning efficiency.
  • Keywords
    expectation-maximisation algorithm; optimisation; support vector machines; UCI data; biological microarray data; direct zero-norm optimization; expectation maximization; feature selection; polynomial time; quadratic programming problems; support vector machine; Bayesian methods; Computer science; Data engineering; Data mining; Machine learning; Mathematics; Optimization methods; Polynomials; Quadratic programming; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
  • Conference_Location
    Pisa
  • ISSN
    1550-4786
  • Print_ISBN
    978-0-7695-3502-9
  • Type

    conf

  • DOI
    10.1109/ICDM.2008.60
  • Filename
    4781189