Title :
An experimental comparison of MES aggregation rules in case of imbalanced datasets
Author_Institution :
Integrated Res. Centre, Univ. Campus Bio-Medico of Rome, Rome, Italy
Abstract :
Learning under imbalanced dataset can be difficult since traditional algorithms are biased towards the majority class, providing low predictive accuracy over the minority one. Among the several methods proposed in the literature to overcome such a limitation, the most recent uses multi-experts system (MES) composed of balanced classifiers, whose decisions are aggregated according to a combination rule. Each classifier of the MES is trained with a balanced subset of the original training set, which can be determined applying different division methods. This paper explores how different MES combination rules perform with imbalanced TS, experimentally comparing fusion and selection combination criteria. Furthermore, two methods have been used to divide the original TS, namely random selection and clustering. The results confirm and extend previous findings reported in the literature showing that, on the one side, combination rules belonging to selection framework outperform the others and, on the other side, dividing the original training set via random selection rather than clustering permits to attain better performance.
Keywords :
data handling; expert systems; pattern classification; balanced classifiers; combination rules; imbalanced dataset; multi-expert system aggregation rules; random selection; Accuracy; Algorithm design and analysis; Back; Data mining; Diseases; Machine learning; Machine learning algorithms; Medical diagnosis; Medical diagnostic imaging; Text categorization;
Conference_Titel :
Computer-Based Medical Systems, 2009. CBMS 2009. 22nd IEEE International Symposium on
Conference_Location :
Albuquerque, NM
Print_ISBN :
978-1-4244-4879-1
Electronic_ISBN :
1063-7125
DOI :
10.1109/CBMS.2009.5255382