DocumentCode :
1784764
Title :
A hybrid Machine Learning methodology for imbalanced datasets
Author :
Lipitakis, Anastasia-Dimitra ; Kotsiantis, Sotirios
Author_Institution :
Dept. of Math., Univ. of Patras, Patra, Greece
fYear :
2014
fDate :
7-9 July 2014
Firstpage :
252
Lastpage :
257
Abstract :
In the Machine Learning systems several imbalanced data sets exhibit skewed class distributions in which most cases are allocated to a class and far fewer cases to a smaller one. A classifier induced from an imbalanced data set has usually a low error rate for the majority class and an unacceptable error rate for the minority class. In this paper a synoptic review of the various related methodologies is given, a new ensemble methodology is introduced and an experimental study with other ensembles is presented. The proposed method that combines the power of OverBagging and Rotation Forest algorithms improves the identification of a difficult small class, while keeping the classification ability of the other class in an acceptable accuracy level.
Keywords :
learning (artificial intelligence); pattern classification; OverBagging algorithms; classifier; ensemble methodology; hybrid machine learning methodology; imbalanced datasets; low error rate; rotation forest algorithms; skewed class distributions; Accuracy; Bagging; Classification algorithms; Decision trees; Learning systems; Principal component analysis; Training; computational intelligence; ensembles of classifiers; imbalanced data sets; supervised machine learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information, Intelligence, Systems and Applications, IISA 2014, The 5th International Conference on
Conference_Location :
Chania
Type :
conf
DOI :
10.1109/IISA.2014.6878762
Filename :
6878762
Link To Document :
بازگشت