Title :
An Extended Fuzzy k-Means Algorithm for Clustering Categorical Valued Data
Author :
Jiacai, Wang ; Ruijun, Gu
Author_Institution :
Sch. of Inf. Sci., Nanjing Audit Univ., Nanjing, China
Abstract :
Although fuzzy k-modes algorithm has removed the numeric-only limitation of the k-means algorithm, that each attribute of the centroid with a single category value and the use of a simple distance measure will compromise its precision, and therefore prone to falling into local optima. In this paper, an extended fuzzy k-means(xFKM) algorithm for clustering categorical valued data is presented, in which the cluster centroid vectors are represented as expanded forms to keep more clustering information as possible as well, and updated with the method similar to fuzzy k-means algorithm. Experiments on several real databases show that xFKM algorithm can get better clustering result than fuzzy k-modes algorithm does.
Keywords :
data handling; fuzzy set theory; pattern clustering; categorical valued data; cluster centroid vector; clustering information; extended fuzzy k means; local optima; Accuracy; Algorithm design and analysis; Clustering algorithms; Computational efficiency; Cost function; Databases; Partitioning algorithms; categorical data; fuzzy k-modes algorithm; fuzzy partitional clustering;
Conference_Titel :
Artificial Intelligence and Computational Intelligence (AICI), 2010 International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-8432-4
DOI :
10.1109/AICI.2010.225