• DocumentCode
    168792
  • Title

    A Scalable System for Community Discovery in Twitter During Hurricane Sandy

  • Author

    Yin Huang ; Han Dong ; Yesha, Yelena ; Shujia Zhou

  • Author_Institution
    Comput. Sci. & Electr. Eng. Dept., Univ. of Maryland, Baltimore, MD, USA
  • fYear
    2014
  • fDate
    26-29 May 2014
  • Firstpage
    893
  • Lastpage
    899
  • Abstract
    The wide use of micro bloggers such as Twitter offers a valuable and reliable source of information during natural disasters. The big volume of Twitter data calls for a scalable data management system whereas the semi-structured data analysis requires full-text searching function. As a result, it becomes challenging yet essential for disaster response agencies to take full advantage of social media data for decision making in a near-real-time fashion. In this work, we use Lucene to empower HBase with full-text searching ability to build a scalable social media data analytics system for observing and analyzing human behaviors during the Hurricane Sandy disaster. Experiments show the scalability and efficiency of the system. Furthermore, the discovery of communities has the benefit of identifying influential users and tracking the topical changes as the disaster unfolds. We develop a novel approach to discover communities in Twitter by applying spectral clustering algorithm to retweet graph. The topics and influential users of each community are also analyzed and demonstrated using Latent Semantic Indexing (LSI).
  • Keywords
    behavioural sciences computing; data analysis; disasters; social networking (online); storms; HBase; Hurricane Sandy disaster; LSI; Lucene; Twitter; community discovery; full-text searching ability; human behavior analysis; latent semantic indexing; natural disasters; scalable social media data analytics system; scalable system; spectral clustering algorithm; Clustering algorithms; Communities; Distributed databases; Hurricanes; Indexing; Twitter; Hadoop; Hbase; Hurricane Sandy; Lucene; Twitter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/CCGrid.2014.122
  • Filename
    6846543