• DocumentCode
    2053673
  • Title

    Use of neighborhood and stratification approaches to speed up instance selection algorithm

  • Author

    Ros, Frédéric ; Harba, Rachid ; Piclin, Nadege ; Pintore, Marco

  • Author_Institution
    Inst. Prisme, Orleans Univ., Orleans, France
  • fYear
    2010
  • fDate
    7-10 Dec. 2010
  • Firstpage
    320
  • Lastpage
    325
  • Abstract
    This paper investigates a method for instance selection in the context of supervised classification adapted to large databases. Based on the scale up concept, the method reduces the time required to perform the selection procedure by enabling the application of known condensation instance techniques to only small data sets instead of the whole set. The novelty of our approach relies in the way of hybridizing neighborhood and stratification approaches. The key idea is to consider instances found out for a given strata to generate sub populations for the other strata representing critical regions of the feature space. Experiments performed with various data sets revealed the effectiveness and applicability of the proposed approach.
  • Keywords
    data mining; learning (artificial intelligence); pattern classification; set theory; condensation instance techniques; feature space; instance selection algorithm; neighborhood approaches; stratification approaches; supervised classification; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Databases; Nearest neighbor searches; Prototypes; Training; clustering algorithm; instance selection; k-nearest neighbors; supervised classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing and Pattern Recognition (SoCPaR), 2010 International Conference of
  • Conference_Location
    Paris
  • Print_ISBN
    978-1-4244-7897-2
  • Type

    conf

  • DOI
    10.1109/SOCPAR.2010.5686648
  • Filename
    5686648