• DocumentCode
    2160570
  • Title

    Simultaneous continuous feature selection and K clustering by Multi Objective Genetic Algorithm

  • Author

    Dutta, D. ; Dutta, Pranab ; Sil, J.

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Technol., Univ. of Burdwan, Burdwan, India
  • fYear
    2013
  • fDate
    22-23 Feb. 2013
  • Firstpage
    937
  • Lastpage
    942
  • Abstract
    We can classify clustering into two categories. In K Clustering, we know the number of clusters or K. In other category of clustering, K in unknown. In this paper we have considered the first category only. We can broadly classify features within a data set into continuous and categorical. Here we have considered data set with continuous features only. Clustering can be done by all features or by relevant features only. Researches had commonly used some feature selection techniques to select relevant features for clustering and then did clustering by some clustering algorithm. Here we have used Multi Objective Genetic Algorithm (MOGA) for simultaneous feature selection and clustering. Here, K-means is hybridized with GA. We have used hybridized GA to combine global searching abilities of GA with local searching abilities of K-means. Considering context sensitivity, we have used a special crossover operator called “pairwise crossover” and “substitution”. Elimination of redundant, irrelevant features increases clustering performance, reflected in MOGA Feature Selection (H, S) compared with MOGA (H, S). The main contribution of this paper is simultaneous dimensionality reduction and optimization of objectives using MOGA.
  • Keywords
    genetic algorithms; pattern classification; pattern clustering; MOGA feature selection; categorical feature; clustering classification; clustering performance; continuous feature; feature clustering; k clustering; multiobjective genetic algorithm; pairwise crossover operator; simultaneous continuous feature selection; substitution operator; Biological cells; Clustering algorithms; Equations; Genetic algorithms; Mathematical model; Sociology; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advance Computing Conference (IACC), 2013 IEEE 3rd International
  • Conference_Location
    Ghaziabad
  • Print_ISBN
    978-1-4673-4527-9
  • Type

    conf

  • DOI
    10.1109/IAdCC.2013.6514352
  • Filename
    6514352