DocumentCode :
2448274
Title :
Fast instance selection hybrid algorithm adapted to large data sets
Author :
Ros, Frédéric ; Harba, Rachid
Author_Institution :
Inst. Prisme, Orleans Univ., Orleans, France
fYear :
2011
fDate :
14-16 Oct. 2011
Firstpage :
117
Lastpage :
122
Abstract :
This paper investigates a new hybrid algorithm for instance selection adapted to large databases. The key idea is to apply condensation algorithms to only small sets and useful patterns to reduce computation cost. The initial population is divided into “meta strata” resulting from the union of strata randomly generated. Interesting patterns are resulting from a reference “meta stratum” and are partitioned in clusters. For each “meta stratum” and each cluster, influencing patterns are selected on the basis of a 1-nn procedure. The sets of instances determined from all “meta strata” provide the final set. Experiments performed with various data sets are revealing the effectiveness and adequacy of the proposed approach.
Keywords :
data mining; database management systems; pattern clustering; 1-nn procedure; computation cost reduction; condensation algorithm; instance selection hybrid algorithm; large database; meta strata; meta stratum; Accuracy; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Databases; Prototypes; Training; clustering algorithm; instance selection; k-nearest neighbors; supervised classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Soft Computing and Pattern Recognition (SoCPaR), 2011 International Conference of
Conference_Location :
Dalian
Print_ISBN :
978-1-4577-1195-4
Type :
conf
DOI :
10.1109/SoCPaR.2011.6089125
Filename :
6089125
Link To Document :
بازگشت