• Title of article

    Combining instance selection methods based on data characterization: An approach to increase their effectiveness

  • Author/Authors

    Yoel Caises، نويسنده , , Antonio Gonz?lez، نويسنده , , Enrique Leyva، نويسنده , , Ra?l Pérez، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2011
  • Pages
    19
  • From page
    4780
  • To page
    4798
  • Abstract
    Although there are several proposals in the instance selection field, none of them consistently outperforms the others over a wide range of domains. In recent years many authors have come to the conclusion that data must be characterized in order to apply the most suitable selection criterion in each case. In light of this hypothesis, herein we propose a set of measures to characterize databases. These measures were used in decision rules which, given their values for a database, select from some pre-selected methods, the method, or combination of methods, that is expected to produce the best results. The rules were extracted based on an empirical analysis of the behaviors of several methods on several data sets, then integrated into an algorithm which was experimentally evaluated over 20 databases and with six different learning paradigms. The results were compared with those of five well-known state-of-the-art methods.
  • Keywords
    Prototype selection , Instance selection , data reduction , Machine Learning
  • Journal title
    Information Sciences
  • Serial Year
    2011
  • Journal title
    Information Sciences
  • Record number

    1214704