DocumentCode :
1946856
Title :
New construction of Ensemble Classifiers for imbalanced datasets
Author :
Zhai, Yun ; Yang, Bingru ; Ma, Nan ; Ruan, Da
Author_Institution :
Sch. of Inf. Eng., Univ. of Sci. & Technol. Beijing, Beijing, China
fYear :
2010
fDate :
15-16 Nov. 2010
Firstpage :
228
Lastpage :
233
Abstract :
Learning in the presence of data imbalances presents a great challenge to machine learning. Imbalanced data sets represent a significant problem because the corresponding classifier has a tendency to ignore samples which have smaller representation in the training sets. In this paper, we propose an ensemble-based learning algorithm as a new ensemble classifier model called as SVM-C5.0 Ensemble Classifier Model, SCECM. SCECM adopts a differentiated sampling rate algorithm (DSRA) based on an improved Adaboost algorithm and further employs unique classifier-selection strategy, novel classifier integration approach and original classification decision-making method. Comparative experimental results show that the proposed approach improves performance for the minority class while preserving the ability to recognize examples from the majority classes.
Keywords :
learning (artificial intelligence); pattern classification; support vector machines; Adaboost algorithm; SCECM; SVM-C5.0; classification decision-making method; classifier integration approach; classifier-selection strategy; differentiated sampling rate algorithm; ensemble classifier model; ensemble-based learning algorithm; imbalanced datasets; machine learning; Accuracy; Classification algorithms; Data mining; Educational institutions; Machine learning; Support vector machines; Training; classification in imbalanced datasets; data mining; differentiated sampling rate; ensemble model of classifiers; heterogeneous classifier;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems and Knowledge Engineering (ISKE), 2010 International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4244-6791-4
Type :
conf
DOI :
10.1109/ISKE.2010.5680874
Filename :
5680874
Link To Document :
بازگشت