DocumentCode
3418581
Title
Text Clustering via Particle Swarm Optimization
Author
Lu, Yanping ; Wang, Shengrui ; Li, Shaozi ; Zhou, Changle
Author_Institution
Dept. of Comput., Univ. of Sherbrooke, Sherbrooke, QC
fYear
2009
fDate
March 30 2009-April 2 2009
Firstpage
45
Lastpage
51
Abstract
This paper presents an approach which extends a particle swarm optimizer for variable weighting (PSOVW) to handle the problem of text clustering, called text clustering via particle swarm optimization (TCPSO). PSOVW has been exploited for evolving optimal feature weights for clusters and has demonstrated to improve the clustering quality of high-dimensional data. However, when applying it for text clustering, there exist some modifications such as the similarity measure, parameter selection and the criterion function. Our experimental results on both four structured text datasets built from 20 newsgroups as well as four large-scale text datasets selected from CLUTO show that the proposed algorithm is able to greatly improve the quality of text clustering compared to four typical clustering algorithms and one competitive subspace clustering method.
Keywords
data structures; particle swarm optimisation; pattern clustering; high-dimensional data; parameter selection; particle swarm optimization; structured text datasets; subspace clustering method; text clustering; variable weighting; Circuits; Clustering algorithms; Clustering methods; Frequency; Large-scale systems; Merging; Optimization methods; Particle swarm optimization; Partitioning algorithms; Text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Swarm Intelligence Symposium, 2009. SIS '09. IEEE
Conference_Location
Nashville, TN
Print_ISBN
978-1-4244-2762-8
Type
conf
DOI
10.1109/SIS.2009.4937843
Filename
4937843
Link To Document