• DocumentCode
    3418581
  • Title

    Text Clustering via Particle Swarm Optimization

  • Author

    Lu, Yanping ; Wang, Shengrui ; Li, Shaozi ; Zhou, Changle

  • Author_Institution
    Dept. of Comput., Univ. of Sherbrooke, Sherbrooke, QC
  • fYear
    2009
  • fDate
    March 30 2009-April 2 2009
  • Firstpage
    45
  • Lastpage
    51
  • Abstract
    This paper presents an approach which extends a particle swarm optimizer for variable weighting (PSOVW) to handle the problem of text clustering, called text clustering via particle swarm optimization (TCPSO). PSOVW has been exploited for evolving optimal feature weights for clusters and has demonstrated to improve the clustering quality of high-dimensional data. However, when applying it for text clustering, there exist some modifications such as the similarity measure, parameter selection and the criterion function. Our experimental results on both four structured text datasets built from 20 newsgroups as well as four large-scale text datasets selected from CLUTO show that the proposed algorithm is able to greatly improve the quality of text clustering compared to four typical clustering algorithms and one competitive subspace clustering method.
  • Keywords
    data structures; particle swarm optimisation; pattern clustering; high-dimensional data; parameter selection; particle swarm optimization; structured text datasets; subspace clustering method; text clustering; variable weighting; Circuits; Clustering algorithms; Clustering methods; Frequency; Large-scale systems; Merging; Optimization methods; Particle swarm optimization; Partitioning algorithms; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Swarm Intelligence Symposium, 2009. SIS '09. IEEE
  • Conference_Location
    Nashville, TN
  • Print_ISBN
    978-1-4244-2762-8
  • Type

    conf

  • DOI
    10.1109/SIS.2009.4937843
  • Filename
    4937843