• DocumentCode
    3107573
  • Title

    Clustering data set with categorical feature using multi objective genetic algorithm

  • Author

    Dutta, Dipankar ; Dutta, Paramartha ; Sil, Jaya

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Technol., Univ. Inst. of Technol., Golapbug, India
  • fYear
    2012
  • fDate
    18-20 July 2012
  • Firstpage
    103
  • Lastpage
    108
  • Abstract
    In the paper, real coded multi objective genetic algorithm based K-clustering method has been studied where K represents the number of clusters known apriori. The searching power of Genetic Algorithm (GA) is exploited to search for suitable clusters and cluster modes so that intra-cluster distance (Homogeneity, H) and inter-cluster distances (Separation, S) are simultaneously optimized. It is achieved by measuring H and S using Mod distance per feature metric, suitable for categorical features (attributes). We have selected 3 benchmark data sets from UCI Machine Learning Repository containing categorical features only. Here, K-modes is hybridized with GA to combine global searching capabilities of GA with local searching capabilities of K-modes. Considering context sensitivity, we have used a special crossover operator called “pairwise crossover” and “substitution”.
  • Keywords
    data mining; genetic algorithms; learning (artificial intelligence); pattern clustering; GA; K-clustering method; K-modes; Mod distance; UCI machine learning repository; categorical feature; categorical features; cluster modes; context sensitivity; crossover operator; data set clustering; global searching capabilities; intercluster distances; intracluster distance; local searching capabilities; pairwise crossover; real coded multiobjective genetic algorithm; substitution; Biological cells; Clustering algorithms; Genetic algorithms; Indexes; Optimization; Sociology; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Science & Engineering (ICDSE), 2012 International Conference on
  • Conference_Location
    Cochin, Kerala
  • Print_ISBN
    978-1-4673-2148-8
  • Type

    conf

  • DOI
    10.1109/ICDSE.2012.6281897
  • Filename
    6281897