DocumentCode
637091
Title
On the probability of feature selection in support vector classification
Author
Qunfeng Liu ; Lan Yao
Author_Institution
Coll. of Comput., Dongguan Univ. of Technol., Dongguan, China
fYear
2013
fDate
28-30 July 2013
Firstpage
334
Lastpage
339
Abstract
Feature selection is important for classification problem, especially when the number of features is very large or noisiness is present in data. Support vector machine (SVM) with Lp regularization is a popular approach for feature selection. Many researches have devoted to develop efficient methods to solve the optimization problem in support vector machine. However, to our knowledge, there is still no formal proof or comprehensive mathematical understanding on how Lp regularization can bring feature selection. In this paper, we first show that feature selection depends not only the parameter p but also the data itself. If the feasible region generated from the data lies faraway relatively from the coordinates, then feature selection maybe impossible for any p. Otherwise, a small p can help to enhance the ability of feature selection of Lp-SVM. Then we provide a formula for computing the probabilities which measure the feature selection ability. The only assumption is that the optimal solutions of all possible classification problems distribute uniformly on the contour of the objective function. Based on this formula, we compute the probabilities for some popular p.
Keywords
feature extraction; optimisation; pattern classification; probability; support vector machines; Lp regularization; Lp-SVM; classification problem; feature selection; optimal solutions; optimization problem; probability; support vector classification; support vector machine; Educational institutions; Electronic mail; Linear programming; Optimization; Size measurement; Support vector machines; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Service Operations and Logistics, and Informatics (SOLI), 2013 IEEE International Conference on
Conference_Location
Dongguan
Print_ISBN
978-1-4799-0529-4
Type
conf
DOI
10.1109/SOLI.2013.6611436
Filename
6611436
Link To Document