• DocumentCode
    2785951
  • Title

    Principal Component Analysis based Feature Selection for clustering

  • Author

    Xu, Jun-ling ; Xu, Bao-wen ; Zhang, Wei-feng ; Cui, Zi-feng

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Southeast Univ., Nanjing
  • Volume
    1
  • fYear
    2008
  • fDate
    12-15 July 2008
  • Firstpage
    460
  • Lastpage
    465
  • Abstract
    Feature extraction (FE) methods have been proved to be very effective for dimension reduction, but the features attained are meaningless. In order to exploit the effectiveness of FE methods to support feature selection (FS), this paper proposed a new FS approach for clustering based on principal component analysis (PCA) called PS. It first uses PCA to transform the data from original feature space into a new feature space whose features are linear combination of the original ones, and then evaluates the importance of the original features based on the newly generated features and the feature importance measure proposed in this paper, finally selects features incrementally according to their importance to improve the performance of the clustering algorithm. Experiment is carried out on several popular data sets and the results show the advantages of the proposed approach.
  • Keywords
    feature extraction; pattern clustering; principal component analysis; clustering algorithm; dimension reduction; feature extraction; feature importance measure; feature selection; principal component analysis; Clustering algorithms; Cybernetics; Extraterrestrial measurements; Feature extraction; Iron; Linear discriminant analysis; Machine learning; Principal component analysis; Supervised learning; Unsupervised learning; Clustering; Feature selection; Principal component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2008 International Conference on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-2095-7
  • Electronic_ISBN
    978-1-4244-2096-4
  • Type

    conf

  • DOI
    10.1109/ICMLC.2008.4620449
  • Filename
    4620449