Title :
Feature selection using random probes and linear support vector machines
Author :
Chi, Hoi-Ming ; Ersoy, Okan K. ; Moskowitz, Herbert
Author_Institution :
Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN
Abstract :
A novel feature selection algorithm that combines the ideas of linear support vector machines (SVMs) and random probes is proposed. A random probe is first artificially generated from a Gaussian distribution and appended to the data set as an extra input variable. Next, a standard 2-norm or 1-norm linear support vector machine is trained using this new data set. Each coefficient, or weight, in a linear SVM is compared to that of the random probe feature. Under several statistical assumptions, the probability of each input feature being more relevant than the random probe can be computed easily. The proposed feature selection method is intuitive to use in real-world problems, and it automatically determines the optimal number of features needed. It can also be extended to selecting significant interaction and/or quadratic terms in a 2nd-order polynomial representation
Keywords :
Gaussian distribution; feature extraction; random processes; support vector machines; 2nd-order polynomial representation; Gaussian distribution; feature selection; linear support vector machines; random probes; statistical assumptions; Data mining; Gaussian distribution; Input variables; Machine learning algorithms; Polynomials; Probability; Probes; Quadratic programming; Support vector machines; USA Councils;
Conference_Titel :
Computational Intelligence Methods and Applications, 2005 ICSC Congress on
Conference_Location :
Istanbul
Print_ISBN :
1-4244-0020-1
DOI :
10.1109/CIMA.2005.1662318