• DocumentCode
    712887
  • Title

    Document clustering using gravitational ensemble clustering

  • Author

    Sadeghian, Armindokht Hashempour ; Nezamabadi-pour, Hossein

  • Author_Institution
    Dept. of Electr. Eng., Shahid Bahonar Univ. of Kerman, Kerman, Iran
  • fYear
    2015
  • fDate
    3-5 March 2015
  • Firstpage
    240
  • Lastpage
    245
  • Abstract
    Text Mining is a field that is considered as an extension of data mining. In the context of text mining, document clustering is used to set apart likewise documents of a collection into the identical category, called cluster, and divergent documents to distinctive groups. Since every dataset has its own characteristics, finding an appropriate clustering algorithm that can manage all kinds of clusters, is a big challenge. Clustering algorithms has theirs unique approaches for computing the number of clusters, imposing a structure on the data, and attesting the out coming clusters. The idea of combining different clustering is an effort to overwhelm the faults of single algorithms and further enhance their executions. On the other hand, inspired by the gravitational law, different clustering algorithms have been introduced that each one attempted to cluster complex datasets. Gravitational Ensemble Clustering (GEC) is an ensemble method that employs both the concepts of gravitational clustering and ensemble clustering to reach a better clustering result. This paper represents an application of GEC to the problem of document clustering. The proposed method uses a modification of the original GEC algorithm. This modification tries to produce a more varied clustering ensemble using new parameter setting. The GEC algorithm is assessed using document datasets. Promising results of the presented method were obtained in comparison with competing algorithms.
  • Keywords
    data mining; pattern clustering; text analysis; data mining; document clustering; gravitational ensemble clustering; gravitational law; text mining; Algorithm design and analysis; Clustering algorithms; Clustering methods; Data mining; Entropy; Partitioning algorithms; Data Clustering; Data mining; Document clustering; Gravitational clustering; Quality measures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence and Signal Processing (AISP), 2015 International Symposium on
  • Conference_Location
    Mashhad
  • Print_ISBN
    978-1-4799-8817-4
  • Type

    conf

  • DOI
    10.1109/AISP.2015.7123481
  • Filename
    7123481