DocumentCode :
168792
Title :
A Scalable System for Community Discovery in Twitter During Hurricane Sandy
Author :
Yin Huang ; Han Dong ; Yesha, Yelena ; Shujia Zhou
Author_Institution :
Comput. Sci. & Electr. Eng. Dept., Univ. of Maryland, Baltimore, MD, USA
fYear :
2014
fDate :
26-29 May 2014
Firstpage :
893
Lastpage :
899
Abstract :
The wide use of micro bloggers such as Twitter offers a valuable and reliable source of information during natural disasters. The big volume of Twitter data calls for a scalable data management system whereas the semi-structured data analysis requires full-text searching function. As a result, it becomes challenging yet essential for disaster response agencies to take full advantage of social media data for decision making in a near-real-time fashion. In this work, we use Lucene to empower HBase with full-text searching ability to build a scalable social media data analytics system for observing and analyzing human behaviors during the Hurricane Sandy disaster. Experiments show the scalability and efficiency of the system. Furthermore, the discovery of communities has the benefit of identifying influential users and tracking the topical changes as the disaster unfolds. We develop a novel approach to discover communities in Twitter by applying spectral clustering algorithm to retweet graph. The topics and influential users of each community are also analyzed and demonstrated using Latent Semantic Indexing (LSI).
Keywords :
behavioural sciences computing; data analysis; disasters; social networking (online); storms; HBase; Hurricane Sandy disaster; LSI; Lucene; Twitter; community discovery; full-text searching ability; human behavior analysis; latent semantic indexing; natural disasters; scalable social media data analytics system; scalable system; spectral clustering algorithm; Clustering algorithms; Communities; Distributed databases; Hurricanes; Indexing; Twitter; Hadoop; Hbase; Hurricane Sandy; Lucene; Twitter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on
Conference_Location :
Chicago, IL
Type :
conf
DOI :
10.1109/CCGrid.2014.122
Filename :
6846543
Link To Document :
بازگشت