• DocumentCode
    2228552
  • Title

    Instance Selection by using Polar Grids

  • Author

    Sang, Yongsheng ; Yi, Zhang

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
  • Volume
    3
  • fYear
    2010
  • fDate
    20-22 Aug. 2010
  • Abstract
    Instance selection is about algorithms that search for a representative portion of data that can fulfill a data mining task as if the whole data is used. It is a very important data reduction technique, which can spare much memory and running time for data mining algorithms. This paper proposes a new method for Instance Selection by using Polar Grids (ISPG). The main idea is to search for a subset of instances located close to decision boundary by using a method based on polar grids. Original training instances are mapped into Polar reference frame, and the data space is partitioned as a set of polar grids. Then a special search algorithm is designed for determining which instances locate close to decision boundary. The method can also handle noisy instances and smooth data boundaries. The classical k-Nearest Neighbors (kNN) classification algorithm is employed to test the proposed method. Experiments show that the proposed method can reduce datasets effectively and achieve reasonable generalization accuracy. Moreover, the method achieves prominent learning speed, which can be used to process large spatial datasets.
  • Keywords
    data mining; data reduction; pattern clustering; search problems; spatial data structures; data boundaries; data mining; data reduction technique; data representative portion; data space; decision boundary; generalization accuracy; instance selection; k-Nearest Neighbors classification algorithm; large spatial datasets; learning speed; noisy instances; polar grids; polar reference frame; running time; search algorithm; training instances; Accuracy; Noise measurement; Spirals;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computer Theory and Engineering (ICACTE), 2010 3rd International Conference on
  • Conference_Location
    Chengdu
  • ISSN
    2154-7491
  • Print_ISBN
    978-1-4244-6539-2
  • Type

    conf

  • DOI
    10.1109/ICACTE.2010.5579549
  • Filename
    5579549