• DocumentCode
    1785131
  • Title

    A comparative study of redundant feature detection based feature selection methods

  • Author

    Xue-Qiang Zeng ; Qian-Sheng Chen

  • Author_Institution
    Key Lab. of Embedded Syst. & Service Comput., Tongji Univ., Shanghai, China
  • fYear
    2014
  • fDate
    7-9 July 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    As a high dimensional problem, analysis of largescale data sets is a challenging task, where many weakly relevant or redundant features hurt generalization performance of classification models. In order to solve this problem, many effective feature selection methods have proposed to eliminate redundant features in recent years. However, the comparative performances of these redundant feature detection based methods have not been reported yet, which makes the choice of feature selection method relatively difficult for many real applications. The paper presents a novel comparative study of redundant feature detection based feature selection methods. Experiments on several benchmark data sets demonstrate the comparative performances of some state-of-the-arts methods. Based on the extensive empirical results, the minimum Redundancy-Maximum Relevance (mRMR) method has been found to be the best one among all compared feature selection models.
  • Keywords
    feature selection; pattern classification; redundancy; benchmark data sets; classification models; empirical analysis; generalization performance; high-dimensional problem; large-scale data set analysis; mRMR method; minimum redundancy-maximum relevance method; redundant feature detection-based feature selection method; Breast; Colon; Lungs; Redundancy; Comparative Study; Feature Selection; Redundant Feature;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer, Information and Telecommunication Systems (CITS), 2014 International Conference on
  • Conference_Location
    Jeju
  • Print_ISBN
    978-1-4799-4384-5
  • Type

    conf

  • DOI
    10.1109/CITS.2014.6878974
  • Filename
    6878974