Title :
Improved Particle Swarm Optimization Based K-Means Clustering
Author :
Prabha, K. Arun ; Visalakshi, N. Karthikeyani
Author_Institution :
Dept. of Comput. Technol., Vellalar Coll. for Women, Erode, India
Abstract :
Clustering is a popular data analysis and data mining technique. K-Means is one of the most popular data mining algorithms for being simple, scalable and easily modifiable to a variety of contexts and application domains. The major issue of traditional K-Means algorithm is that its performance depends on the initialization of centroid and requires the number of clusters to be specified in advance. Many evolutionary based clustering algorithms have been developed in recent years for selecting optimum initial centroid to optimize clustering results. Particle Swarm Optimization algorithm is a population-based memetic-evolution-motivated meta-heuristic algorithm that mimics the capability of swarm. The K-Means algorithm typically uses Euclidean or squared Euclidean distance to measure the distortion between a data object and its cluster centroid. The Euclidean and squared Euclidean distances are usually computed from raw data and not from standardized data. Normalization is one of the important preprocessing steps, to transform values of all attributes. Effective data clustering can only occur if an equally effective technique for normalizing the data is applied. This paper proposes an effective partitional clustering algorithm which is developed by integrating the merits of Particle Swarm Optimization and normalization with traditional K-Means clustering algorithms. Experiments are conducted on real dataset to prove the efficiency of the proposed algorithm.
Keywords :
data mining; evolutionary computation; particle swarm optimisation; pattern clustering; data analysis; data clustering; data mining; evolutionary based clustering algorithm; k-means clustering; metaheuristic algorithm; particle swarm optimization; partitional clustering algorithm; population-based memetic-evolution; squared Euclidean distance; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Data mining; Indexes; Particle swarm optimization; Partitioning algorithms; Centroid; Evolutionary Algorithm; K-Means; Normalization; Particle Swarm Optimization;
Conference_Titel :
Intelligent Computing Applications (ICICA), 2014 International Conference on
Conference_Location :
Coimbatore
DOI :
10.1109/ICICA.2014.21