DocumentCode
3704792
Title
Towards an efficient platform for social big data analytics
Author
Jenq-Haur Wang;Kuan-Ting Chen
Author_Institution
Department of Computer Science and Information Engineering, National Taipei University of Technology, Taipei, Taiwan
fYear
2015
Firstpage
175
Lastpage
179
Abstract
With the development of social networks, people can easily share their feelings and express their opinions on interesting topics. Thus, social media mining is becoming an important research topic. However, huge volume of social Web data in various forms are rapidly generated around the world in a much faster speed than those of any other media. This can lead to the difficulties in social data analysis: noisy data and efficiency. In this paper, we propose a distributed data analytics platform for social media. First, data from various sources are collected and stored in a distributed index for efficient retrieval. Then, a distributed analytics framework is built from memory-based cluster computing based on the MapReduce paradigm. Finally, statistical analysis is performed and integrated for presentation. In the experiment, we built the platform by open source projects Hadoop and Spark, and implemented combinations of map and reduce operations. We compared the efficiency and scalability of the platform on various check-in datasets. Further evaluation is needed to verify the performance in different types of operations.
Keywords
"Media","Big data","Sparks","Social network services","Distributed databases","Data analysis","Indexes"
Publisher
ieee
Conference_Titel
Wireless and Optical Communication Conference (WOCC), 2015 24th
ISSN
2379-1268
Print_ISBN
978-1-4799-8868-6
Electronic_ISBN
2379-1276
Type
conf
DOI
10.1109/WOCC.2015.7346200
Filename
7346200
Link To Document