DocumentCode
3751516
Title
Twitter Data Mining for Events Classification and Analysis
Author
Nausheen Azam; Jahiruddin;Muhammad Abulaish;Nur Al-Hasan Haldar
Author_Institution
Sch. of IT, Centre for Dev. of Adv. Comput., Noida, India
fYear
2015
Firstpage
79
Lastpage
83
Abstract
The increasing popularity of the micro-blogging sites like Twitter, which facilitates users to exchange short messages (aka tweets) is an impetus for data analytics tasks for varied purposes, ranging from business intelligence to nation security. Twitter is being used by a large number of users for events update and sentiment expression. Since tweets are generally unstructured in nature and do not follow grammatical structures, parsing techniques generally do not work well due to incorrect parts-of-speech assignment to individual words. In this paper, we have proposed an n-gram based statistical approach to identify significant terms and using them for vector-space modelling of the tweets. Thereafter, a social graph generation method is proposed, considering tweets as nodes and the degree of similarity between a pair of tweets as a weighted edge between them. The social graph is decomposed into various clusters using Markov Clustering technique, wherein each cluster corresponds to a particular event. The experiment is carried out using a corpus of 3100 tweets related to Israel-Gaza conflicts, Delhi assembly election, and union budget 2015. The experimental results are encouraging, showing the efficacy of the proposed social graph generation and event classification methods.
Keywords
"Twitter","Matrix decomposition","Data mining","Nominations and elections","Feature extraction","Markov processes"
Publisher
ieee
Conference_Titel
Soft Computing and Machine Intelligence (ISCMI), 2015 Second International Conference on
Type
conf
DOI
10.1109/ISCMI.2015.33
Filename
7414678
Link To Document