DocumentCode :
182907
Title :
Sampling strategies and sample redundancy in population-based fuzzy modeling and classification
Author :
Ruijun Dong
Author_Institution :
Dept. of Autom., Xidian Univ., Xi´an, China
fYear :
2014
fDate :
19-21 Aug. 2014
Firstpage :
94
Lastpage :
97
Abstract :
Training set sampling strategies are used in the context of population-based fuzzy modeling and classification to evaluate the quality of a data set. The objective is to explain the use of down-sampling strategies as a means for reducing the number of redundant samples. It is believed the clustering to be rather generally applicable to sampling in classification applications. The hypothesis is empirically validated by examining the performance of differential evolution fuzzy classifiers on Iris and Breast Cancer Wisconsin data sets. The learning curves of the classifiers are analyzed with respect to the choice of the sampling strategies. A nearly-optimal classification performance of the classifier is achieved using a relatively small training sample, showing that population-based fuzzy modeling and classification can be successfully applied to large data sets with affordable computation.
Keywords :
fuzzy set theory; genetic algorithms; particle swarm optimisation; pattern classification; DE; GA; Iris data set; PFMC; PSO; Wisconsin breast cancer data set; data set quality; differential evolution; genetic algorithm; particle swarm optimization; population-based fuzzy modelling and classification; sample redundancy; sampling strategies; Computational modeling; Iris; Redundancy; Signal processing algorithms; Sociology; Statistics; Training; down-sampling; population-based fuzzy modeling and classification; sample redundancy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4799-5147-5
Type :
conf
DOI :
10.1109/FSKD.2014.6980813
Filename :
6980813
Link To Document :
بازگشت