Title :
A comparative study of redundant feature detection based feature selection methods
Author :
Xue-Qiang Zeng ; Qian-Sheng Chen
Author_Institution :
Key Lab. of Embedded Syst. & Service Comput., Tongji Univ., Shanghai, China
Abstract :
As a high dimensional problem, analysis of largescale data sets is a challenging task, where many weakly relevant or redundant features hurt generalization performance of classification models. In order to solve this problem, many effective feature selection methods have proposed to eliminate redundant features in recent years. However, the comparative performances of these redundant feature detection based methods have not been reported yet, which makes the choice of feature selection method relatively difficult for many real applications. The paper presents a novel comparative study of redundant feature detection based feature selection methods. Experiments on several benchmark data sets demonstrate the comparative performances of some state-of-the-arts methods. Based on the extensive empirical results, the minimum Redundancy-Maximum Relevance (mRMR) method has been found to be the best one among all compared feature selection models.
Keywords :
feature selection; pattern classification; redundancy; benchmark data sets; classification models; empirical analysis; generalization performance; high-dimensional problem; large-scale data set analysis; mRMR method; minimum redundancy-maximum relevance method; redundant feature detection-based feature selection method; Breast; Colon; Lungs; Redundancy; Comparative Study; Feature Selection; Redundant Feature;
Conference_Titel :
Computer, Information and Telecommunication Systems (CITS), 2014 International Conference on
Conference_Location :
Jeju
Print_ISBN :
978-1-4799-4384-5
DOI :
10.1109/CITS.2014.6878974