• DocumentCode
    3117153
  • Title

    Comparison of scaling behavior between fuzzy c-means based classifier with many parameters and LibSVM

  • Author

    Ichihashi, Hidetomo ; Honda, Katsuhiro ; Notsu, Akira

  • Author_Institution
    Dept. of Comput. Sci. & Intell. Syst., Osaka Prefecture Univ., Sakai, Japan
  • fYear
    2011
  • fDate
    27-30 June 2011
  • Firstpage
    386
  • Lastpage
    393
  • Abstract
    This paper reports the scaling behavior of the fuzzy c-means based classifier (FCMC) with many parameters. FCMC is a classifier based on clustering approaches. The classification accuracy on test sets (i.e., the generalization capability) is not necessarily improved by increasing the number of clusters. Especially when the number of training samples is relatively small, not only the classification boundary over-fits the data, but also covariance matrices and cluster centers are computed incorrectly, since the number of samples in each cluster becomes smaller. Hence, the test set accuracy deteriorates. The performance of FCMC with two clusters in each class and the number of training samples less than 1000, was reported in the literature. This paper reports the scaling behavior of FCMC by testing with variously-sized training samples. The number of clusters of FCMC is increased up to eight. The number of clusters used in this paper is not very large but the number of parameters is relatively large. So, the parameters are optimized to training sets. LibSVM is one of the widely known state of the art tools for support vector machines (SVM). The test set accuracy, training time and testing time (i.e., the detection time) of FCMC are compared with LibSVM by varying the size of training sets. FCMC shows a good generalization capability, though the parameters are optimized to training sets. When the number of training samples is increased by 10 times, the training time of FCMC increases by 10 times, but that of LibSVM increases by a factor of 100. The testing time is also much shorter than LibSVM when the size of the training set is large.
  • Keywords
    covariance matrices; fuzzy set theory; pattern classification; pattern clustering; support vector machines; FCMC; LibSVM; classification boundary; cluster centers; clustering approach; covariance matrices; fuzzy c-means based classifier; scaling behavior testing; support vector machine; Accuracy; Covariance matrix; Error analysis; Principal component analysis; Support vector machines; Training; classifier; clustering; large data set; scaling behavior;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1098-7584
  • Print_ISBN
    978-1-4244-7315-1
  • Electronic_ISBN
    1098-7584
  • Type

    conf

  • DOI
    10.1109/FUZZY.2011.6007352
  • Filename
    6007352