DocumentCode :
1521390
Title :
Discriminative Semi-Supervised Feature Selection Via Manifold Regularization
Author :
Xu, Zenglin ; King, Irwin ; Lyu, Michael Rung-Tsong ; Jin, Rong
Author_Institution :
Max Planck Inst. for Inf., Saarland Univ., Saarbrucken, Germany
Volume :
21
Issue :
7
fYear :
2010
fDate :
7/1/2010 12:00:00 AM
Firstpage :
1033
Lastpage :
1047
Abstract :
Feature selection has attracted a huge amount of interest in both research and application communities of data mining. We consider the problem of semi-supervised feature selection, where we are given a small amount of labeled examples and a large amount of unlabeled examples. Since a small number of labeled samples are usually insufficient for identifying the relevant features, the critical problem arising from semi-supervised feature selection is how to take advantage of the information underneath the unlabeled data. To address this problem, we propose a novel discriminative semi-supervised feature selection method based on the idea of manifold regularization. The proposed approach selects features through maximizing the classification margin between different classes and simultaneously exploiting the geometry of the probability distribution that generates both labeled and unlabeled data. In comparison with previous semi-supervised feature selection algorithms, our proposed semi-supervised feature selection method is an embedded feature selection method and is able to find more discriminative features. We formulate the proposed feature selection method into a convex-concave optimization problem, where the saddle point corresponds to the optimal solution. To find the optimal solution, the level method, a fairly recent optimization method, is employed. We also present a theoretic proof of the convergence rate for the application of the level method to our problem. Empirical evaluation on several benchmark data sets demonstrates the effectiveness of the proposed semi-supervised feature selection method.
Keywords :
data mining; learning (artificial intelligence); optimisation; probability; benchmark data sets; convex concave optimization problem; data mining; discriminative semisupervised feature selection; manifold regularization; probability distribution; Feature selection; level method; manifold regularization; multiple kernel learning; semi-supervised learning; Algorithms; Artificial Intelligence; Computer Simulation; Data Mining; Decision Support Techniques; Humans; Learning;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/TNN.2010.2047114
Filename :
5491193
Link To Document :
بازگشت