Title :
A study of the scaling up capabilities of stratified prototype generation
Author :
Triguero, I. ; Derrac, J. ; Herrera, F. ; García, S.
Author_Institution :
Dept. of Comput. Sci. & Artificial Intell., Univ. of Granada, Granada, Spain
Abstract :
Prototype generation is an appropriate data reduction process for improving the efficiency and the efficacy of the nearest neighbor rule. Specifically, evolutionary prototype generation techniques have been highlighted as the best performing methods. However, these methods can sometimes be inefficient when the data scale up. In other data reduction techniques, such as prototype selection, an stratification procedure has been successfully developed to deal with large data sets. In this study, we test the combination of stratification with prototype generation techniques, considering data sets with more than 10000 instances. We compare some of the most representative prototype reduction methods and perform a study of the effects of stratification in their behavior. The results, contrasted with nonparametric statistical tests, show that several prototype generation techniques present a better performance than previously analyzed methods.
Keywords :
data reduction; evolutionary computation; statistical analysis; data reduction process; evolutionary prototype generation techniques; nearest neighbor rule; nonparametric statistical tests; stratified prototype generation; Accuracy; Biology; Complexity theory; Data mining; Prototypes; Training; Training data; Data Reduction; Nearest Neighbor; Prototype Generation; Scaling up; Stratification;
Conference_Titel :
Nature and Biologically Inspired Computing (NaBIC), 2011 Third World Congress on
Conference_Location :
Salamanca
Print_ISBN :
978-1-4577-1122-0
DOI :
10.1109/NaBIC.2011.6089611