DocumentCode :
3761874
Title :
A study on topics identification on Twitter using clustering algorithms
Author :
Marjori N. M. Klinczak;Celso A. A. Kaestner
Author_Institution :
Mosaic Web and Graduate Program in Applied Computer Science, Federal University of Technology - Paran? Avenida Sete de Setembro 3165 80230-901 Curitiba - Paran? - Brazil
fYear :
2015
Firstpage :
1
Lastpage :
6
Abstract :
The identification of topics in Social Networks has become an important research task when dealing with event detection, particularly when global communities are affected. Text processing techniques and machine learning algorithms have been extensively used to solve this problem. In this paper we compare three clustering algorithms - k-means, k-medoids and NMF (Non-negative Matrix Factorization) - in order to detect topics related to textual messages obtained from Twitter. The algorithms were applied to a database composed by tweets, having as initial context hashtags that are related to the recent scandal of corruption involving FIFA (International Federation of Football Association). Obtained results suggest that the NMF presents better results, since it provides providing clusters that are easier to interpret.
Keywords :
"Clustering algorithms","Twitter","Algorithm design and analysis","Principal component analysis","Text processing","Context"
Publisher :
ieee
Conference_Titel :
Computational Intelligence (LA-CCI), 2015 Latin America Congress on
Type :
conf
DOI :
10.1109/LA-CCI.2015.7435965
Filename :
7435965
Link To Document :
بازگشت