DocumentCode :
3730452
Title :
Ensemble unsupervised feature selection based on permutation and R-value
Author :
Xiaomei Wang; Xiaohui Lin; Xin Huang; Yuansheng Yang
Author_Institution :
School of Computer Science and Technology, Dalian University of Technology, China
fYear :
2015
Firstpage :
795
Lastpage :
800
Abstract :
Selecting the informative features from the high dimensional data can improve the performance of the classification and get a deep understanding of the problems. A non-problem related feature contains little information and has little influence on the data distribution. By permuting the feature and calculating the data distribution difference, how much information the feature contains could be measured. In this paper, we propose an unsupervised feature selection method (EUFSPR), which combines the ensemble technique, clustering, permutation and data distribution evaluation techniques to measure the feature importance. Clustering is adopted to get the sample groups and the data distribution is evaluated by the overlapping areas. Eight gene expression microarray datasets are utilized to demonstrate the effectiveness of the proposed method over the unsupervised feature selection methods and supervised feature selection methods.
Keywords :
"Clustering algorithms","Indexes","Support vector machines","Classification algorithms","Unsupervised learning","Computer science","Algorithm design and analysis"
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2015 12th International Conference on
Type :
conf
DOI :
10.1109/FSKD.2015.7382044
Filename :
7382044
Link To Document :
بازگشت