• DocumentCode
    1840538
  • Title

    Hierarchical-Hyperspherical Divisive Fuzzy C-Means (H2D-FCM) Clustering for Information Retrieval

  • Author

    Bordogna, Gloria ; Pasi, Gabriella

  • Volume
    1
  • fYear
    2009
  • fDate
    15-18 Sept. 2009
  • Firstpage
    614
  • Lastpage
    621
  • Abstract
    In this paper an original soft hierarchical Fuzzy Clustering algorithm is proposed, named Hierarchical Hyper-spherical Divisive Fuzzy C-Means (H2D-FCM), with the following characteristics: it generates a “soft” hierarchy in which a document can belong to several child clusters of a node, and the clusters in the same hierarchical level are more specific (general) than the clusters in the upper (lower) level. The proposed algorithm is a divisive algorithm based on a modified bisective K-Means, applying a modified probabilistic Fuzzy C Means algorithm to divide each node into child-nodes. The algorithm determines the proper number of cluster to generate at the first level based on an entropy measure and decides if a node can be further split based on a “density” measure. The paper presents the algorithm and its evaluations on two standard collections.
  • Keywords
    Character generation; Clustering algorithms; Conferences; Content based retrieval; Density measurement; Entropy; Information filtering; Information retrieval; Intelligent agent; Unsupervised learning; Fuzzy C Means Algorithm; Fuzzy Hierarchical clustering; Vector Space Model.; documents clustering; unsupervised hierarchical categorization;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technologies, 2009. WI-IAT '09. IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Milan, Italy
  • Print_ISBN
    978-0-7695-3801-3
  • Electronic_ISBN
    978-1-4244-5331-3
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2009.104
  • Filename
    5284910