DocumentCode :
2962028
Title :
Initialization of k-modes clustering for categorical data
Author :
Li Tao-Ying ; Chen Yan ; Jin Zhi-hong ; Li Ye
Author_Institution :
Transp. Manage. Coll., Dalian Maritime Univ., Dalian, China
fYear :
2013
fDate :
17-19 July 2013
Firstpage :
107
Lastpage :
112
Abstract :
The k-modes clustering algorithm is undoubtedly one of the most widely used partitional algorithms for categorical data. Unfortunately, due to its gradient descent nature, this algorithm is highly sensitive to the initialization of clustering. Categorical initialization methods have been proposed to address this problem. In this paper, we present an overview of initialization methods of clustering for numerical data and categorical data respectively with an emphasis on their computational efficiency. We then propose a new initialization method for categorical data, which can obtain the good initial cluster centers using the new distance base on the RD, and explore the methods of density and grid. Finally, proposed method has been tested on diagnosis dataset, a real world data set from UCI Machine Learning Repository, and been analyzed the experimental results, which illustrates that the proposed method is effective and efficient for initializing categorical data.
Keywords :
gradient methods; pattern clustering; categorical data clustering; categorical initialization methods; cluster centers; computational efficiency; gradient descent nature; k-modes clustering algorithm; numerical data clustering; partitional algorithms; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Computational efficiency; Computational modeling; Pain; Partitioning algorithms; categorical data; density and grid measure; initialization of clustering; the k-modes clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Management Science and Engineering (ICMSE), 2013 International Conference on
Conference_Location :
Harbin
ISSN :
2155-1847
Print_ISBN :
978-1-4799-0473-0
Type :
conf
DOI :
10.1109/ICMSE.2013.6586269
Filename :
6586269
Link To Document :
بازگشت