• DocumentCode
    1774007
  • Title

    A novel feature selection by clustering coefficients of variations

  • Author

    Fong, Simon ; Liang, Justin ; Wong, Rita ; Ghanavati, Mojgan

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Univ. of Macau, Taipa, China
  • fYear
    2014
  • fDate
    Sept. 29 2014-Oct. 1 2014
  • Firstpage
    205
  • Lastpage
    213
  • Abstract
    One of the challenges in inferring a classification model with good prediction accuracy is to select the relevant features that contribute to maximum predictive power. Many feature selection techniques have been proposed and studied in the past, but none so far claimed to be the best. In this paper, a novel and efficient feature selection method called Clustering Coefficients of Variation (CCV) is proposed. CCV is based on a very simple principle of variance-basis which finds an optimal balance between generalization and overfitting. Through a computer simulation experiment, 44 datasets with substantially large number of features are tested by CCV in comparison to four popular feature selection techniques. Results show that CCV outperformed them in all aspects of averaged performances and speed. By the simplicity of design it is anticipated that CCV will be a useful alternative of pre-processing method for classification especially with those datasets that are characterized by many features.
  • Keywords
    feature selection; pattern classification; pattern clustering; statistical analysis; CCV; classification model; clustering coefficients of variations; feature selection; variance-basis principle; Accuracy; Complexity theory; Computational modeling; Correlation; Predictive models; Standards; Training; Classification; Data Mining; Feature Selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Information Management (ICDIM), 2014 Ninth International Conference on
  • Conference_Location
    Phitsanulok
  • Type

    conf

  • DOI
    10.1109/ICDIM.2014.6991429
  • Filename
    6991429