DocumentCode
182907
Title
Sampling strategies and sample redundancy in population-based fuzzy modeling and classification
Author
Ruijun Dong
Author_Institution
Dept. of Autom., Xidian Univ., Xi´an, China
fYear
2014
fDate
19-21 Aug. 2014
Firstpage
94
Lastpage
97
Abstract
Training set sampling strategies are used in the context of population-based fuzzy modeling and classification to evaluate the quality of a data set. The objective is to explain the use of down-sampling strategies as a means for reducing the number of redundant samples. It is believed the clustering to be rather generally applicable to sampling in classification applications. The hypothesis is empirically validated by examining the performance of differential evolution fuzzy classifiers on Iris and Breast Cancer Wisconsin data sets. The learning curves of the classifiers are analyzed with respect to the choice of the sampling strategies. A nearly-optimal classification performance of the classifier is achieved using a relatively small training sample, showing that population-based fuzzy modeling and classification can be successfully applied to large data sets with affordable computation.
Keywords
fuzzy set theory; genetic algorithms; particle swarm optimisation; pattern classification; DE; GA; Iris data set; PFMC; PSO; Wisconsin breast cancer data set; data set quality; differential evolution; genetic algorithm; particle swarm optimization; population-based fuzzy modelling and classification; sample redundancy; sampling strategies; Computational modeling; Iris; Redundancy; Signal processing algorithms; Sociology; Statistics; Training; down-sampling; population-based fuzzy modeling and classification; sample redundancy;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location
Xiamen
Print_ISBN
978-1-4799-5147-5
Type
conf
DOI
10.1109/FSKD.2014.6980813
Filename
6980813
Link To Document