DocumentCode :
2228552
Title :
Instance Selection by using Polar Grids
Author :
Sang, Yongsheng ; Yi, Zhang
Author_Institution :
Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
Volume :
3
fYear :
2010
fDate :
20-22 Aug. 2010
Abstract :
Instance selection is about algorithms that search for a representative portion of data that can fulfill a data mining task as if the whole data is used. It is a very important data reduction technique, which can spare much memory and running time for data mining algorithms. This paper proposes a new method for Instance Selection by using Polar Grids (ISPG). The main idea is to search for a subset of instances located close to decision boundary by using a method based on polar grids. Original training instances are mapped into Polar reference frame, and the data space is partitioned as a set of polar grids. Then a special search algorithm is designed for determining which instances locate close to decision boundary. The method can also handle noisy instances and smooth data boundaries. The classical k-Nearest Neighbors (kNN) classification algorithm is employed to test the proposed method. Experiments show that the proposed method can reduce datasets effectively and achieve reasonable generalization accuracy. Moreover, the method achieves prominent learning speed, which can be used to process large spatial datasets.
Keywords :
data mining; data reduction; pattern clustering; search problems; spatial data structures; data boundaries; data mining; data reduction technique; data representative portion; data space; decision boundary; generalization accuracy; instance selection; k-Nearest Neighbors classification algorithm; large spatial datasets; learning speed; noisy instances; polar grids; polar reference frame; running time; search algorithm; training instances; Accuracy; Noise measurement; Spirals;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Computer Theory and Engineering (ICACTE), 2010 3rd International Conference on
Conference_Location :
Chengdu
ISSN :
2154-7491
Print_ISBN :
978-1-4244-6539-2
Type :
conf
DOI :
10.1109/ICACTE.2010.5579549
Filename :
5579549
Link To Document :
بازگشت