DocumentCode :
3121610
Title :
A comparison of distance-based semi-supervised fuzzy c-means clustering algorithms
Author :
Lai, Daphne Teck Ching ; Garibaldi, Jonathan M.
Author_Institution :
Sch. of Comput. Sci., Univ. of Nottingham, Nottingham, UK
fYear :
2011
fDate :
27-30 June 2011
Firstpage :
1580
Lastpage :
1586
Abstract :
There are many issues to be considered in the design of distance-based fuzzy semi-supervised clustering (FSSC) algorithms. To identify these issues, we compare the performance of four such algorithms. We describe the properties of these algorithms, highlighting their key differences, and then experimentally compare their performance on common datasets. Several experimental conditions are investigated. Firstly, two forms of initialisation of the membership values of unlabelled patterns are used; 1/c and 0. Secondly, the algorithms are run with varying proportions of labelled patterns in the datasets, ranging from 2% to 40%. We find that no algorithm outperforms the others in all the datasets. We also observe that small modifications in similar objective functions can improve clustering, and that most of the algorithms perform slightly better with zero initialisation of unlabelled patterns. An interesting observation is that the increase in labelled patterns does not always improve clustering. From these results, we conclude that the number and scale of dimensions in the data set, initial partition matrix, distance metrics and objective functions, together, affect clustering results. In addition, we conclude that not all initially labelled patterns are good candidates for supervision.
Keywords :
fuzzy set theory; learning (artificial intelligence); pattern clustering; distance based semisupervised fuzzy c-means clustering algorithm; distance metrics; initial partition matrix; labelled patterns; membership values; objective functions; unlabelled patterns; Algorithm design and analysis; Clustering algorithms; Euclidean distance; Kernel; Partitioning algorithms; Prototypes; Fuzzy c-means; semi-supervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1098-7584
Print_ISBN :
978-1-4244-7315-1
Electronic_ISBN :
1098-7584
Type :
conf
DOI :
10.1109/FUZZY.2011.6007562
Filename :
6007562
Link To Document :
بازگشت