DocumentCode
1805211
Title
Improved SMOTEBagging and its application in imbalanced data classification
Author
Zhang Yongqing ; Zhu Min ; Zhang Danling ; Mi Gang ; Ma Daichuan
Author_Institution
School of Computer Science, Sichuan University, Chengdu, China
fYear
2013
fDate
1-8 Jan. 2013
Firstpage
1
Lastpage
5
Abstract
Many real world data mining applications involve imbalanced data sets, When all kinds of data are unevenly distributed and the particular evens of interest may be very few when compared to the other class. Data sets that contain rare evens usually produces biased classifiers that have a higher predictive accuracy over the majority class, but poorer predictive accuracy over the minority class of interest. This paper presents a novel ensemble algorithm with improved SMOTE, and combines selective ensemble with Bagging, which balances the class distribution with improved SMOTEBagging algorithm. Experiments on four UCI data sets and protein-protein interaction experiments mentioned above prove the performance of the method.
Keywords
Bioinformatics; Classification algorithms; Proteins; Support vector machine classification; Tin; Bagging; Imbalanced Datasets; SMOTE; SVM;
fLanguage
English
Publisher
ieee
Conference_Titel
Conference Anthology, IEEE
Conference_Location
China
Type
conf
DOI
10.1109/ANTHOLOGY.2013.6784957
Filename
6784957
Link To Document