DocumentCode
3117153
Title
Comparison of scaling behavior between fuzzy c-means based classifier with many parameters and LibSVM
Author
Ichihashi, Hidetomo ; Honda, Katsuhiro ; Notsu, Akira
Author_Institution
Dept. of Comput. Sci. & Intell. Syst., Osaka Prefecture Univ., Sakai, Japan
fYear
2011
fDate
27-30 June 2011
Firstpage
386
Lastpage
393
Abstract
This paper reports the scaling behavior of the fuzzy c-means based classifier (FCMC) with many parameters. FCMC is a classifier based on clustering approaches. The classification accuracy on test sets (i.e., the generalization capability) is not necessarily improved by increasing the number of clusters. Especially when the number of training samples is relatively small, not only the classification boundary over-fits the data, but also covariance matrices and cluster centers are computed incorrectly, since the number of samples in each cluster becomes smaller. Hence, the test set accuracy deteriorates. The performance of FCMC with two clusters in each class and the number of training samples less than 1000, was reported in the literature. This paper reports the scaling behavior of FCMC by testing with variously-sized training samples. The number of clusters of FCMC is increased up to eight. The number of clusters used in this paper is not very large but the number of parameters is relatively large. So, the parameters are optimized to training sets. LibSVM is one of the widely known state of the art tools for support vector machines (SVM). The test set accuracy, training time and testing time (i.e., the detection time) of FCMC are compared with LibSVM by varying the size of training sets. FCMC shows a good generalization capability, though the parameters are optimized to training sets. When the number of training samples is increased by 10 times, the training time of FCMC increases by 10 times, but that of LibSVM increases by a factor of 100. The testing time is also much shorter than LibSVM when the size of the training set is large.
Keywords
covariance matrices; fuzzy set theory; pattern classification; pattern clustering; support vector machines; FCMC; LibSVM; classification boundary; cluster centers; clustering approach; covariance matrices; fuzzy c-means based classifier; scaling behavior testing; support vector machine; Accuracy; Covariance matrix; Error analysis; Principal component analysis; Support vector machines; Training; classifier; clustering; large data set; scaling behavior;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
Conference_Location
Taipei
ISSN
1098-7584
Print_ISBN
978-1-4244-7315-1
Electronic_ISBN
1098-7584
Type
conf
DOI
10.1109/FUZZY.2011.6007352
Filename
6007352
Link To Document