DocumentCode :
3059530
Title :
New distance measure based on the domain for categorical data
Author :
Aranganayagi, S. ; Thangavel, K. ; Sujatha, S.
Author_Institution :
J.K.K. Nataraja Coll. of Arts & Sci., Komarapalayam, India
fYear :
2009
fDate :
13-15 Dec. 2009
Firstpage :
93
Lastpage :
96
Abstract :
Clustering the process of grouping homogeneous objects is an important data mining process. Few algorithms exist to cluster categorical data. K-modes is the scalable and efficient algorithm to cluster the categorical data. In this paper we propose a new distance measure for K-modes based on the cardinality of domain of attribute. The proposed method is experimented with data sets obtained from UCI data repository. Results prove that the proposed measure generates better clusters than the K-modes algorithm.
Keywords :
data mining; pattern clustering; K-modes algorithm; UCI data repository; categorical data; cluster categorical data; data clustering process; data mining process; distance measure; Clustering algorithms; Clustering methods; Data engineering; Data mining; Databases; Educational institutions; Fasteners; Frequency measurement; Histograms; Weight measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Computing, 2009. ICAC 2009. First International Conference on
Conference_Location :
Chennai
Print_ISBN :
978-1-4244-4786-2
Electronic_ISBN :
978-1-4244-4787-9
Type :
conf
DOI :
10.1109/ICADVC.2009.5378267
Filename :
5378267
Link To Document :
بازگشت