• DocumentCode
    3345834
  • Title

    A GA-Based Feature Selection for High-Dimensional Data Clustering

  • Author

    Sun, Mei ; Xiong, Langhuan ; Sun, Haojun ; Jiang, Dazhi

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Shantou Univ., Shantou, China
  • fYear
    2009
  • fDate
    14-17 Oct. 2009
  • Firstpage
    769
  • Lastpage
    772
  • Abstract
    High-dimensional data clustering is an open problem in modern data mining. This paper proposed a new genetic algorithm-based feature selection for high-dimensional data clustering, called GA-FSFclustering. This approach searches effective feature subsets for clustering in all features by genetic algorithm. The candidate features and cluster centers are real number encoded. A new criterion for evaluating feature subsets is employed as the fitness function. The experimental results indicate the feasibility and efficiency of the GA-FSFclustering algorithm.
  • Keywords
    data mining; genetic algorithms; pattern clustering; GA-FSFclustering; data mining; feature selection; genetic algorithm; high-dimensional data clustering; Algorithm design and analysis; Clustering algorithms; Computer science; Data mining; Encoding; Genetic algorithms; Greedy algorithms; Information analysis; Information security; Sun; clustering; feature selection; genetic algorithms; high-dimensional data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genetic and Evolutionary Computing, 2009. WGEC '09. 3rd International Conference on
  • Conference_Location
    Guilin
  • Print_ISBN
    978-0-7695-3899-0
  • Type

    conf

  • DOI
    10.1109/WGEC.2009.140
  • Filename
    5402823