• DocumentCode
    602531
  • Title

    Identification of non-disjoint clusters with small and parameterizable overlaps

  • Author

    Ben N´Cir, Chiheb-Eddine ; Cleuziou, G. ; Essoussi, Nadia

  • fYear
    2013
  • fDate
    20-22 Jan. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Identification of non-disjoint groups in unlabeled data sets is an important issue in clustering. Many real life applications require to find overlapping clusters in order to fit the data set structures such as clustering of films where each film can have different genres. This paper presents an overlapping k-means method refereed as Restricted-OKM (Restricted Overlapping k-means) that generalizes the well known k-means algorithm to detect overlapping clusters. The proposed method produces restricted overlapping boundaries between clusters and improves clustering accuracy to make the method adapted for clustering data with small overlaps. The proposed method is extended to control sizes of overlaps between clusters with respect to user expectations. Experiments, performed on overlapping data sets, show that proposed methods outperform OKM (Overlapping k-means) and fuzzy c-means in terms of clustering accuracy and produce clusters with small overlapping boundaries.
  • Keywords
    fuzzy set theory; identification; pattern clustering; data clustering; data set structure; film clustering; fuzzy c-means; identification; k-means algorithm; nondisjoint cluster; nondisjoint group; overlapping cluster; overlapping k-means method; parameterizable overlap; restricted overlapping boundary; restricted overlapping k-means; restricted-OKM; unlabeled data set; user expectation; Clustering algorithms; Clustering methods; Equations; Linear programming; Minimization; Prototypes; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Applications Technology (ICCAT), 2013 International Conference on
  • Conference_Location
    Sousse
  • Print_ISBN
    978-1-4673-5284-0
  • Type

    conf

  • DOI
    10.1109/ICCAT.2013.6522010
  • Filename
    6522010