DocumentCode
2053673
Title
Use of neighborhood and stratification approaches to speed up instance selection algorithm
Author
Ros, Frédéric ; Harba, Rachid ; Piclin, Nadege ; Pintore, Marco
Author_Institution
Inst. Prisme, Orleans Univ., Orleans, France
fYear
2010
fDate
7-10 Dec. 2010
Firstpage
320
Lastpage
325
Abstract
This paper investigates a method for instance selection in the context of supervised classification adapted to large databases. Based on the scale up concept, the method reduces the time required to perform the selection procedure by enabling the application of known condensation instance techniques to only small data sets instead of the whole set. The novelty of our approach relies in the way of hybridizing neighborhood and stratification approaches. The key idea is to consider instances found out for a given strata to generate sub populations for the other strata representing critical regions of the feature space. Experiments performed with various data sets revealed the effectiveness and applicability of the proposed approach.
Keywords
data mining; learning (artificial intelligence); pattern classification; set theory; condensation instance techniques; feature space; instance selection algorithm; neighborhood approaches; stratification approaches; supervised classification; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Databases; Nearest neighbor searches; Prototypes; Training; clustering algorithm; instance selection; k-nearest neighbors; supervised classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Soft Computing and Pattern Recognition (SoCPaR), 2010 International Conference of
Conference_Location
Paris
Print_ISBN
978-1-4244-7897-2
Type
conf
DOI
10.1109/SOCPAR.2010.5686648
Filename
5686648
Link To Document