DocumentCode
2717516
Title
Clustering massive categorical data with class association rules
Author
Berrado, Abdelaziz ; Runger, George
Author_Institution
Al Akhawayn Univ.
fYear
2008
fDate
16-18 Dec. 2008
Firstpage
223
Lastpage
227
Abstract
Clustering algorithms partition data sets into groups of objects such that the pairwise similarity between objects within the same cluster is higher than those assigned to different clusters. Defining a similarity measure becomes challenging in the presence of categorical data and affects the quality and meaningfulness of the clusters formed. Furthermore, the curse of dimensionality diminishes the robustness of such measures. This paper introduces SCAR (supervised clustering with association rules) a nontraditional algorithm for clustering massive high dimensional categorical data. SCAR is robust to the curse of dimensionality, it relies on association rules as an intuitive way to evaluate the similarity between objects and group them.
Keywords
data mining; pattern clustering; SCAR; class association rules; clustering algorithms; clustering massive categorical data; supervised clustering; Association rules; Clustering algorithms; Clustering methods; Entropy; Euclidean distance; Mutual information; Partitioning algorithms; Robustness; Supervised learning; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovations in Information Technology, 2008. IIT 2008. International Conference on
Conference_Location
Al Ain
Print_ISBN
978-1-4244-3396-4
Electronic_ISBN
978-1-4244-3397-1
Type
conf
DOI
10.1109/INNOVATIONS.2008.4781693
Filename
4781693
Link To Document