DocumentCode
3124170
Title
Is it rational to partition a data set using kernel-clustering?
Author
Sarkar, Kaushik ; Pal, Nikhil R.
Author_Institution
Dept. of Electron. & Commun. Eng., Narula Inst. of Technol., Kolkata, India
fYear
2011
fDate
27-30 June 2011
Firstpage
2600
Lastpage
2605
Abstract
Many interesting papers have been written in the recent past on kernel clustering and many attractive results have also been demonstrated. Here we question the rationality behind such clustering approaches. Using simple data sets we argue and demonstrate that it is not a good idea to find clusters in the kernel space when the objective is to look for clusters in the original data because in the kernel space the data may have a different geometry from that in the original feature space. In particular we demonstrate the following : (1) improper choice of the number of clusters may lead to very counterintuitive clusters (e.g., instead of merging nearby clusters, it may merge clusters that are far from each other) and (2) improper choice of kernel parameters has a significant effect on the extracted clusters and it can even impose arbitrary cluster structures that are undoubtedly absent in the original data. However, we definitely do not imply that kernel clustering can never produce desirable results. In fact, kernel clustering could be useful provided we can choose right kernel parameters. But the process being unsupervised, we do not have a solution to this issue yet. In this study, for illustration, we use one variant of the kernel Fuzzy C-Means (KFCM) clustering algorithm in conjunction with Polynomial kernels.
Keywords
data analysis; fuzzy set theory; pattern clustering; arbitrary cluster structures; data set; feature space; kernel fuzzy C-means clustering algorithm; kernel parameters; kernel space; Clustering algorithms; Correlation; Data mining; Kernel; Partitioning algorithms; Polynomials; Prototypes; FCM; Fuzzy C-means; Kernel fuzzy C-means; Kernel space; feature space;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
Conference_Location
Taipei
ISSN
1098-7584
Print_ISBN
978-1-4244-7315-1
Electronic_ISBN
1098-7584
Type
conf
DOI
10.1109/FUZZY.2011.6007685
Filename
6007685
Link To Document