Title :
Cluster validation in k-Means clustering of mixed databases based on principal component analysis
Author :
Nonoguchi, R. ; Honda, Kazuhiro ; Notsu, A. ; Ichihashi, Hayato
Author_Institution :
Dept. of Comput. Sci. & Intell. Syst., Osaka Prefecture Univ., Sakai, Japan
Abstract :
Considering the close relation between k-Means clustering and principal component analysis (PCA), a cluster validation approach for k-Means partitions was proposed using analytical solutions of PCA. In this paper, the validation approach is further extended for handling mixed databases composed of not only numerical observations but also categorical observations. In the new validation approach for k-Means clustering of mixed databases, PCA solutions are given by considering optimal scaling of category observations, and the plausibility of k-Means solutions are evaluated by calculating deviations from the PCA solutions after Procrustean rotation.
Keywords :
database management systems; learning (artificial intelligence); pattern clustering; principal component analysis; PCA solution; categorical observation; cluster validation approach; k-means clustering; mixed database clustering; numerical observation; principal component analysis; procrustean rotation;
Conference_Titel :
Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2012 Joint 6th International Conference on
Conference_Location :
Kobe
Print_ISBN :
978-1-4673-2742-8
DOI :
10.1109/SCIS-ISIS.2012.6505102