• DocumentCode
    1879660
  • Title

    Document clustering using particle swarm optimization

  • Author

    Cui, Xiaohui ; Potok, Thomas E. ; Palathingal, Paul

  • Author_Institution
    Computational Sci. & Eng. Div., Oak Ridge Nat. Lab., TN, USA
  • fYear
    2005
  • fDate
    8-10 June 2005
  • Firstpage
    185
  • Lastpage
    191
  • Abstract
    Fast and high-quality document clustering algorithms play an important role in effectively navigating, summarizing, and organizing information. Recent studies have shown that partitional clustering algorithms are more suitable for clustering large datasets. However, the K-means algorithm, the most commonly used partitional clustering algorithm, can only generate a local optimal solution. In this paper, we present a particle swarm optimization (PSO) document clustering algorithm. Contrary to the localized searching of the K-means algorithm, the PSO clustering algorithm performs a globalized search in the entire solution space. In the experiments we conducted, we applied the PSO, K-means and hybrid PSO clustering algorithm on four different text document datasets. The number of documents in the datasets ranges from 204 to over 800, and the number of terms ranges from over 5000 to over 7000. The results illustrate that the hybrid PSO algorithm can generate more compact clustering results than the K-means algorithm.
  • Keywords
    data mining; document handling; particle swarm optimisation; search problems; very large databases; K-means algorithm; document clustering; globalized search; particle swarm optimization; partitional clustering algorithm; Clustering algorithms; Data mining; Hybrid power systems; Laboratories; Navigation; Organizing; Particle swarm optimization; Partitioning algorithms; Software algorithms; Software engineering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Swarm Intelligence Symposium, 2005. SIS 2005. Proceedings 2005 IEEE
  • Print_ISBN
    0-7803-8916-6
  • Type

    conf

  • DOI
    10.1109/SIS.2005.1501621
  • Filename
    1501621