DocumentCode
602531
Title
Identification of non-disjoint clusters with small and parameterizable overlaps
Author
Ben N´Cir, Chiheb-Eddine ; Cleuziou, G. ; Essoussi, Nadia
fYear
2013
fDate
20-22 Jan. 2013
Firstpage
1
Lastpage
6
Abstract
Identification of non-disjoint groups in unlabeled data sets is an important issue in clustering. Many real life applications require to find overlapping clusters in order to fit the data set structures such as clustering of films where each film can have different genres. This paper presents an overlapping k-means method refereed as Restricted-OKM (Restricted Overlapping k-means) that generalizes the well known k-means algorithm to detect overlapping clusters. The proposed method produces restricted overlapping boundaries between clusters and improves clustering accuracy to make the method adapted for clustering data with small overlaps. The proposed method is extended to control sizes of overlaps between clusters with respect to user expectations. Experiments, performed on overlapping data sets, show that proposed methods outperform OKM (Overlapping k-means) and fuzzy c-means in terms of clustering accuracy and produce clusters with small overlapping boundaries.
Keywords
fuzzy set theory; identification; pattern clustering; data clustering; data set structure; film clustering; fuzzy c-means; identification; k-means algorithm; nondisjoint cluster; nondisjoint group; overlapping cluster; overlapping k-means method; parameterizable overlap; restricted overlapping boundary; restricted overlapping k-means; restricted-OKM; unlabeled data set; user expectation; Clustering algorithms; Clustering methods; Equations; Linear programming; Minimization; Prototypes; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Applications Technology (ICCAT), 2013 International Conference on
Conference_Location
Sousse
Print_ISBN
978-1-4673-5284-0
Type
conf
DOI
10.1109/ICCAT.2013.6522010
Filename
6522010
Link To Document