• DocumentCode
    3751516
  • Title

    Twitter Data Mining for Events Classification and Analysis

  • Author

    Nausheen Azam; Jahiruddin;Muhammad Abulaish;Nur Al-Hasan Haldar

  • Author_Institution
    Sch. of IT, Centre for Dev. of Adv. Comput., Noida, India
  • fYear
    2015
  • Firstpage
    79
  • Lastpage
    83
  • Abstract
    The increasing popularity of the micro-blogging sites like Twitter, which facilitates users to exchange short messages (aka tweets) is an impetus for data analytics tasks for varied purposes, ranging from business intelligence to nation security. Twitter is being used by a large number of users for events update and sentiment expression. Since tweets are generally unstructured in nature and do not follow grammatical structures, parsing techniques generally do not work well due to incorrect parts-of-speech assignment to individual words. In this paper, we have proposed an n-gram based statistical approach to identify significant terms and using them for vector-space modelling of the tweets. Thereafter, a social graph generation method is proposed, considering tweets as nodes and the degree of similarity between a pair of tweets as a weighted edge between them. The social graph is decomposed into various clusters using Markov Clustering technique, wherein each cluster corresponds to a particular event. The experiment is carried out using a corpus of 3100 tweets related to Israel-Gaza conflicts, Delhi assembly election, and union budget 2015. The experimental results are encouraging, showing the efficacy of the proposed social graph generation and event classification methods.
  • Keywords
    "Twitter","Matrix decomposition","Data mining","Nominations and elections","Feature extraction","Markov processes"
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing and Machine Intelligence (ISCMI), 2015 Second International Conference on
  • Type

    conf

  • DOI
    10.1109/ISCMI.2015.33
  • Filename
    7414678