DocumentCode
2228552
Title
Instance Selection by using Polar Grids
Author
Sang, Yongsheng ; Yi, Zhang
Author_Institution
Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
Volume
3
fYear
2010
fDate
20-22 Aug. 2010
Abstract
Instance selection is about algorithms that search for a representative portion of data that can fulfill a data mining task as if the whole data is used. It is a very important data reduction technique, which can spare much memory and running time for data mining algorithms. This paper proposes a new method for Instance Selection by using Polar Grids (ISPG). The main idea is to search for a subset of instances located close to decision boundary by using a method based on polar grids. Original training instances are mapped into Polar reference frame, and the data space is partitioned as a set of polar grids. Then a special search algorithm is designed for determining which instances locate close to decision boundary. The method can also handle noisy instances and smooth data boundaries. The classical k-Nearest Neighbors (kNN) classification algorithm is employed to test the proposed method. Experiments show that the proposed method can reduce datasets effectively and achieve reasonable generalization accuracy. Moreover, the method achieves prominent learning speed, which can be used to process large spatial datasets.
Keywords
data mining; data reduction; pattern clustering; search problems; spatial data structures; data boundaries; data mining; data reduction technique; data representative portion; data space; decision boundary; generalization accuracy; instance selection; k-Nearest Neighbors classification algorithm; large spatial datasets; learning speed; noisy instances; polar grids; polar reference frame; running time; search algorithm; training instances; Accuracy; Noise measurement; Spirals;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Computer Theory and Engineering (ICACTE), 2010 3rd International Conference on
Conference_Location
Chengdu
ISSN
2154-7491
Print_ISBN
978-1-4244-6539-2
Type
conf
DOI
10.1109/ICACTE.2010.5579549
Filename
5579549
Link To Document