Title :
Date clustering using Principal Component Analysis and Particle Swarm Optimization
Author :
Shi-Wei, Li ; Xiao-Dong, Qian
Author_Institution :
Dept. of Traffic & Transp., Lanzhou Jiaotong Univ., Lanzhou, China
Abstract :
By analyzing the actuality of the data mining in ecommerce environment, and considering the complexity and the curse of dimensionality about extracting the implicit and unknown knowledge brought by the massive and high-dimensional data. Based on the K-means clustering, Particle Swarm Optimization (PSO) clustering and hybrid PSO clustering algorithm, this paper presented a model which combined Principal Component Analysis (PCA) with hybrid PSO to cluster data. The interrelated data have been processed by Principal Component Analysis; the results of PCA are input data for hybrid PSO clustering algorithms. It not only reduced the dimension of input variables and the scales of clustering, but also reserved the main information of original variables and eliminated the multicollinearity between the variables. It offered an effective model method for data clustering which has characters like massive, high-dimensional and heterogeneous.
Keywords :
data mining; particle swarm optimisation; pattern clustering; principal component analysis; K-means clustering; data mining; date clustering; ecommerce environment; hybrid PSO clustering; particle swarm optimization; principal component analysis; Algorithm design and analysis; Clustering algorithms; Data mining; Equations; Mathematical model; Particle swarm optimization; Principal component analysis; curse of dimensionality; k-means clustering; particle swarm optimization; principal component analysis;
Conference_Titel :
Computer Science and Education (ICCSE), 2010 5th International Conference on
Conference_Location :
Hefei
Print_ISBN :
978-1-4244-6002-1
DOI :
10.1109/ICCSE.2010.5593568