• DocumentCode
    3124170
  • Title

    Is it rational to partition a data set using kernel-clustering?

  • Author

    Sarkar, Kaushik ; Pal, Nikhil R.

  • Author_Institution
    Dept. of Electron. & Commun. Eng., Narula Inst. of Technol., Kolkata, India
  • fYear
    2011
  • fDate
    27-30 June 2011
  • Firstpage
    2600
  • Lastpage
    2605
  • Abstract
    Many interesting papers have been written in the recent past on kernel clustering and many attractive results have also been demonstrated. Here we question the rationality behind such clustering approaches. Using simple data sets we argue and demonstrate that it is not a good idea to find clusters in the kernel space when the objective is to look for clusters in the original data because in the kernel space the data may have a different geometry from that in the original feature space. In particular we demonstrate the following : (1) improper choice of the number of clusters may lead to very counterintuitive clusters (e.g., instead of merging nearby clusters, it may merge clusters that are far from each other) and (2) improper choice of kernel parameters has a significant effect on the extracted clusters and it can even impose arbitrary cluster structures that are undoubtedly absent in the original data. However, we definitely do not imply that kernel clustering can never produce desirable results. In fact, kernel clustering could be useful provided we can choose right kernel parameters. But the process being unsupervised, we do not have a solution to this issue yet. In this study, for illustration, we use one variant of the kernel Fuzzy C-Means (KFCM) clustering algorithm in conjunction with Polynomial kernels.
  • Keywords
    data analysis; fuzzy set theory; pattern clustering; arbitrary cluster structures; data set; feature space; kernel fuzzy C-means clustering algorithm; kernel parameters; kernel space; Clustering algorithms; Correlation; Data mining; Kernel; Partitioning algorithms; Polynomials; Prototypes; FCM; Fuzzy C-means; Kernel fuzzy C-means; Kernel space; feature space;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1098-7584
  • Print_ISBN
    978-1-4244-7315-1
  • Electronic_ISBN
    1098-7584
  • Type

    conf

  • DOI
    10.1109/FUZZY.2011.6007685
  • Filename
    6007685