Title :
Semi_Fisher Score: A semi-supervised method for feature selection
Author :
Yang, Ming ; Chen, Yin-juan ; Ji, Gen-lin
Author_Institution :
Sch. of Comput. Sci. & Technol., Nanjing Normal Univ., Nanjing, China
Abstract :
Feature selection is an important problem for pattern classifier systems. As compared to unsupervised feature selection methods, supervised feature selection approaches have better performance when the given training samples with supervised information are sufficient. However, in reality, usually only a few labeled data are obtained, since obtaining class labels is expensive but many unlabeled data can be easily gotten. For this case, directly using the existing supervised feature selection algorithms may be failed because the data distribution may not be accurately estimated only by using a few labeled data. So, in this paper, we introduce a semi-supervised method for feature selection, called Semi_Fisher Score, the new model attempts to effectively simultaneously utilize all labeled and unlabeled samples for improving the performance of the classical Fisher Score. Experiments on 4 UCI datasets by using three different classifiers(KNN, RBFNN and C4.5)show the effectiveness of our algorithm.
Keywords :
data analysis; pattern classification; radial basis function networks; unsupervised learning; C4.5; KNN; RBFNN; Semi_Fisher Score; UCI datasets; classical Fisher score; data distribution; pattern classifier systems; semisupervised method; supervised feature selection algorithms; supervised information; unlabeled data; unsupervised feature selection methods; Accuracy; Algorithm design and analysis; Classification algorithms; Feature extraction; Machine learning; Manifolds; Vehicles; Feature Selection; Fisher Score; Semi-supervised feature selection;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2010 International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-1-4244-6526-2
DOI :
10.1109/ICMLC.2010.5581007