Title :
Feature/Model Selection by the Linear Programming SVM Combined with State-of-Art Classifiers: What Can We Learn About the Data
Author :
Pranckeviciene, Erinija ; Somorjai, Ray ; Tran, Muoi N.
Author_Institution :
Nat. Res. Council Canada, Winnipeg
Abstract :
Many real-world classification problems are represented by very sparse and high-dimensional data. The recent successes of a linear programming support vector machine (LPSVM) for feature selection motivated a deeper analysis of the method when applied to sparse, multivariate data. Due to the sparseness, the selection of a classification model is greatly influenced by the characteristics of that particular dataset. In this study, we investigate a feature selection strategy based on LPSVM as the initial feature filter, combined with state-of-art classification rules, and apply to five real-life datasets of the agnostic learning vs. prior knowledge challenge of IJCNN2007. Our goal is to better understand the robustness of LPSVM as a feature filter. Our analysis suggests that LPSVM can be a useful black box method for identification of the profile of the informative features in the data. If the data are complex and better separable by nonlinear methods, then feature pre-filtering by LPSVM enhances the data representation for other classifiers.
Keywords :
data structures; linear programming; support vector machines; agnostic learning; black box method; feature pre-filtering; feature-model selection; high-dimensional data representation; linear programming; linear programming SVM; nonlinear methods; prior knowledge challenge; profile identification; real-world classification problems; sparse multivariate data; state-of-art classifiers; support vector machine; Data analysis; Error analysis; Filters; Linear programming; Neural networks; Robustness; Support vector machine classification; Support vector machines; Testing; Throughput;
Conference_Titel :
Neural Networks, 2007. IJCNN 2007. International Joint Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4244-1379-9
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2007.4371201