DocumentCode
3440583
Title
Improved Text Clustering Algorithm and Application in Microblogging Public Opinion Analysis
Author
Yiyang Wang ; Li Wang ; Jing Qi ; Zhong Qian ; Bo Xu ; Chao Lei ; Yuexiang Yang ; Huali Cai
Author_Institution
Sch. of Econ. & Manage., Beijing Univ. of Aeronaut. & Astronaut., Beijing, China
fYear
2013
fDate
3-4 Dec. 2013
Firstpage
27
Lastpage
31
Abstract
Based on K-Means algorithm and agglomerative hierarchical clustering algorithm, improvement was made regarding the use of clustering algorithm in the application of text mining. It was verified that the accuracy and efficiency of hot topic detection had been enhanced via vector representation of the text, text similarity calculation, and implementation of clustering algorithm.
Keywords
Web sites; data mining; pattern clustering; text analysis; vectors; K-means algorithm; agglomerative hierarchical clustering algorithm; hot topic detection; microblogging public opinion analysis; text clustering algorithm; text mining; text similarity calculation; vector representation; Accuracy; Algorithm design and analysis; Clustering algorithms; Educational institutions; Optimization; Partitioning algorithms; Vectors; K-Means algorithm; agglomerative hierarchical clustering; hot topic detection; text clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering (WCSE), 2013 Fourth World Congress on
Conference_Location
Hong Kong
Print_ISBN
978-1-4799-2882-8
Type
conf
DOI
10.1109/WCSE.2013.9
Filename
6754259
Link To Document