DocumentCode
1784764
Title
A hybrid Machine Learning methodology for imbalanced datasets
Author
Lipitakis, Anastasia-Dimitra ; Kotsiantis, Sotirios
Author_Institution
Dept. of Math., Univ. of Patras, Patra, Greece
fYear
2014
fDate
7-9 July 2014
Firstpage
252
Lastpage
257
Abstract
In the Machine Learning systems several imbalanced data sets exhibit skewed class distributions in which most cases are allocated to a class and far fewer cases to a smaller one. A classifier induced from an imbalanced data set has usually a low error rate for the majority class and an unacceptable error rate for the minority class. In this paper a synoptic review of the various related methodologies is given, a new ensemble methodology is introduced and an experimental study with other ensembles is presented. The proposed method that combines the power of OverBagging and Rotation Forest algorithms improves the identification of a difficult small class, while keeping the classification ability of the other class in an acceptable accuracy level.
Keywords
learning (artificial intelligence); pattern classification; OverBagging algorithms; classifier; ensemble methodology; hybrid machine learning methodology; imbalanced datasets; low error rate; rotation forest algorithms; skewed class distributions; Accuracy; Bagging; Classification algorithms; Decision trees; Learning systems; Principal component analysis; Training; computational intelligence; ensembles of classifiers; imbalanced data sets; supervised machine learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Information, Intelligence, Systems and Applications, IISA 2014, The 5th International Conference on
Conference_Location
Chania
Type
conf
DOI
10.1109/IISA.2014.6878762
Filename
6878762
Link To Document