DocumentCode :
2002951
Title :
Cluster validation in k-Means clustering of mixed databases based on principal component analysis
Author :
Nonoguchi, R. ; Honda, Kazuhiro ; Notsu, A. ; Ichihashi, Hayato
Author_Institution :
Dept. of Comput. Sci. & Intell. Syst., Osaka Prefecture Univ., Sakai, Japan
fYear :
2012
fDate :
20-24 Nov. 2012
Firstpage :
1784
Lastpage :
1789
Abstract :
Considering the close relation between k-Means clustering and principal component analysis (PCA), a cluster validation approach for k-Means partitions was proposed using analytical solutions of PCA. In this paper, the validation approach is further extended for handling mixed databases composed of not only numerical observations but also categorical observations. In the new validation approach for k-Means clustering of mixed databases, PCA solutions are given by considering optimal scaling of category observations, and the plausibility of k-Means solutions are evaluated by calculating deviations from the PCA solutions after Procrustean rotation.
Keywords :
database management systems; learning (artificial intelligence); pattern clustering; principal component analysis; PCA solution; categorical observation; cluster validation approach; k-means clustering; mixed database clustering; numerical observation; principal component analysis; procrustean rotation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2012 Joint 6th International Conference on
Conference_Location :
Kobe
Print_ISBN :
978-1-4673-2742-8
Type :
conf
DOI :
10.1109/SCIS-ISIS.2012.6505102
Filename :
6505102
Link To Document :
بازگشت