DocumentCode
2448274
Title
Fast instance selection hybrid algorithm adapted to large data sets
Author
Ros, Frédéric ; Harba, Rachid
Author_Institution
Inst. Prisme, Orleans Univ., Orleans, France
fYear
2011
fDate
14-16 Oct. 2011
Firstpage
117
Lastpage
122
Abstract
This paper investigates a new hybrid algorithm for instance selection adapted to large databases. The key idea is to apply condensation algorithms to only small sets and useful patterns to reduce computation cost. The initial population is divided into “meta strata” resulting from the union of strata randomly generated. Interesting patterns are resulting from a reference “meta stratum” and are partitioned in clusters. For each “meta stratum” and each cluster, influencing patterns are selected on the basis of a 1-nn procedure. The sets of instances determined from all “meta strata” provide the final set. Experiments performed with various data sets are revealing the effectiveness and adequacy of the proposed approach.
Keywords
data mining; database management systems; pattern clustering; 1-nn procedure; computation cost reduction; condensation algorithm; instance selection hybrid algorithm; large database; meta strata; meta stratum; Accuracy; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Databases; Prototypes; Training; clustering algorithm; instance selection; k-nearest neighbors; supervised classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Soft Computing and Pattern Recognition (SoCPaR), 2011 International Conference of
Conference_Location
Dalian
Print_ISBN
978-1-4577-1195-4
Type
conf
DOI
10.1109/SoCPaR.2011.6089125
Filename
6089125
Link To Document