Title :
Feature and instances selection for nearest neighbor classification via cooperative PSO
Author :
Ahmad, S. Sakinah S.
Author_Institution :
Fac. of Inf. & Commun. Technol., Univ. Teknikal Malaysia Melaka, Melaka, Malaysia
Abstract :
Data reduction is an essential task in the data preparation phase of knowledge discovery and data mining (KDD). The reduction method contains two techniques, namely features reduction and data reduction which are commonly applied to a classification problem. The solution of data reduction can be viewed as a search problem. Therefore, it can be solved by using population-based techniques such as Genetic Algorithm and Particle Swarm Optimization. This paper proposes the integration of feature reduction and data reduction for Nearest Neighbor (NN) classification using Cooperative Binary Particle Swarm Optimization (CBPSO). This method can overcome the limitation of using the Nearest Neighbor (NN) classifier when dealing with high dimensional and large data. The proposed method is applied to 14 real world dataset from the machine learning repository. The algorithm´s performance is illustrated by the corresponding table of the classification rate. The experimental results demonstrate the effectiveness of our proposed method.
Keywords :
data mining; feature selection; genetic algorithms; learning (artificial intelligence); particle swarm optimisation; pattern classification; search problems; CBPSO; KDD; NN classification; classification rate; cooperative PSO; cooperative binary particle swarm optimization; data preparation phase; data reduction; feature reduction; feature selection; genetic algorithm; high-dimensional large data; instance selection; knowledge discovery-and-data mining; machine learning repository; nearest neighbor classification; population-based techniques; real world dataset; search problem; Accuracy; Algorithm design and analysis; Classification algorithms; Genetic algorithms; Iris; Optimization; Training; Binary Cooperative Particle Swarm Optimization; Feature reduction; Instances reduction; Nearest Neighbor;
Conference_Titel :
Information and Communication Technologies (WICT), 2014 Fourth World Congress on
Print_ISBN :
978-1-4799-8114-4
DOI :
10.1109/WICT.2014.7077300