• DocumentCode
    2619543
  • Title

    Towards urban phenomenon sensing by automatic tagging of tweets

  • Author

    Khan, Muhammad Asif Hossain ; Iwai, Masayuki ; Sezaki, Kaoru

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
  • fYear
    2012
  • fDate
    11-14 June 2012
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    Micro-blogging sites like Twitter have become a valuable source of information due to their recent upsurge in popularity. The objective of this research is to sense the urban phenomena like peoples´ interest in particular topics or shift of interest from one topic to another. Sometimes, even just the proportion of tweets related to a topic appearing in the daily Twitter corpus of a region can give a good indication about peoples´ level of interest in that topic on that particular day. Unfortunately, most of the tweets are not explicitly tagged with topic keywords by the Twitter users. In this paper we propose a method for automatic tagging of untagged tweets. Our method is based on identification of important collocations from a large training set of tweets. We then train a multinomial Naiıve Bayes classifier using these collocation features for tagging untagged tweets. We could achieve 88.25% accuracy with high precision and recall.
  • Keywords
    Bayes methods; classification; information dissemination; information filtering; social networking (online); Twitter; automatic tagging; collocation features; microblogging site; multinomial Naiıve Bayes classifier; tweets; untagged tweet; urban phenomenon sensing; Accuracy; Marketing and sales; Noise measurement; Tagging; Training; Training data; Twitter; Collective intelligence; Short text classification; Trend analysis; Urban sensing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networked Sensing Systems (INSS), 2012 Ninth International Conference on
  • Conference_Location
    Antwerp
  • Print_ISBN
    978-1-4673-1784-9
  • Electronic_ISBN
    978-1-4673-1785-6
  • Type

    conf

  • DOI
    10.1109/INSS.2012.6240529
  • Filename
    6240529