Title of article :
EUSBoost: Enhancing ensembles for highly imbalanced data-sets by evolutionary undersampling
Author/Authors :
Galar، نويسنده , , Mikel and Fernلndez، نويسنده , , Alberto and Barrenechea، نويسنده , , Edurne and Herrera، نويسنده , , Francisco، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2013
Pages :
12
From page :
3460
To page :
3471
Abstract :
Classification with imbalanced data-sets has become one of the most challenging problems in Data Mining. Being one class much more represented than the other produces undesirable effects in both the learning and classification processes, mainly regarding the minority class. Such a problem needs accurate tools to be undertaken; lately, ensembles of classifiers have emerged as a possible solution. Among ensemble proposals, the combination of Bagging and Boosting with preprocessing techniques has proved its ability to enhance the classification of the minority class. s paper, we develop a new ensemble construction algorithm (EUSBoost) based on RUSBoost, one of the simplest and most accurate ensemble, which combines random undersampling with Boosting algorithm. Our methodology aims to improve the existing proposals enhancing the performance of the base classifiers by the usage of the evolutionary undersampling approach. Besides, we promote diversity favoring the usage of different subsets of majority class instances to train each base classifier. Centered on two-class highly imbalanced problems, we will prove, supported by the proper statistical analysis, that EUSBoost is able to outperform the state-of-the-art methods based on ensembles. We will also analyze its advantages using kappa-error diagrams, which we adapt to the imbalanced scenario.
Keywords :
Classification , Imbalanced data-sets , Ensembles , Class distribution , Kappa-error diagrams , Boosting
Journal title :
PATTERN RECOGNITION
Serial Year :
2013
Journal title :
PATTERN RECOGNITION
Record number :
1735719
Link To Document :
بازگشت