Title of article
Combining instance selection methods based on data characterization: An approach to increase their effectiveness
Author/Authors
Yoel Caises، نويسنده , , Antonio Gonz?lez، نويسنده , , Enrique Leyva، نويسنده , , Ra?l Pérez، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2011
Pages
19
From page
4780
To page
4798
Abstract
Although there are several proposals in the instance selection field, none of them consistently outperforms the others over a wide range of domains. In recent years many authors have come to the conclusion that data must be characterized in order to apply the most suitable selection criterion in each case. In light of this hypothesis, herein we propose a set of measures to characterize databases. These measures were used in decision rules which, given their values for a database, select from some pre-selected methods, the method, or combination of methods, that is expected to produce the best results. The rules were extracted based on an empirical analysis of the behaviors of several methods on several data sets, then integrated into an algorithm which was experimentally evaluated over 20 databases and with six different learning paradigms. The results were compared with those of five well-known state-of-the-art methods.
Keywords
Prototype selection , Instance selection , data reduction , Machine Learning
Journal title
Information Sciences
Serial Year
2011
Journal title
Information Sciences
Record number
1214704
Link To Document