DocumentCode
188550
Title
An Entropy-Based Subspace Clustering Algorithm for Categorical Data
Author
Carbonera, Joel Luis ; Abel, Mara
Author_Institution
Inst. of Inf., Univ. Fed. do Rio Grande do Sul - UFRGS, Porto Alegre, Brazil
fYear
2014
fDate
10-12 Nov. 2014
Firstpage
272
Lastpage
277
Abstract
The interest in attribute weighting for soft subspace clustering have been increasing in the last years. However, most of the proposed approaches are designed for dealing only with numeric data. In this paper, our focus is on soft subspace clustering for categorical data. In soft subspace clustering, the attribute weighting approach plays a crucial role. Due to this, we propose an entropy-based approach for measuring the relevance of each categorical attribute in each cluster. Besides that, we propose the EBK-modes (entropy-based k-modes), an extension of the basic k-modes that uses our approach for attribute weighting. We performed experiments on five real-world datasets, comparing the performance of our algorithms with four state-of-the-art algorithms, using three well-known evaluation metrics: accuracy, f-measure and adjusted Rand index. According to the experiments, the EBK-modes outperforms the algorithms that were considered in the evaluation, regarding the considered metrics.
Keywords
entropy; pattern clustering; EBK-modes; adjusted Rand index; attribute weighting approach; basic k-modes; categorical data; entropy-based subspace clustering algorithm; evaluation metrics; f-measure; soft subspace clustering; Accuracy; Breast cancer; Clustering algorithms; Entropy; Indexes; Partitioning algorithms; Uncertainty; attribute weighting; categorical data; clustering; data mining; entropy; subspace clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence (ICTAI), 2014 IEEE 26th International Conference on
Conference_Location
Limassol
ISSN
1082-3409
Type
conf
DOI
10.1109/ICTAI.2014.48
Filename
6984484
Link To Document