DocumentCode :
3704792
Title :
Towards an efficient platform for social big data analytics
Author :
Jenq-Haur Wang;Kuan-Ting Chen
Author_Institution :
Department of Computer Science and Information Engineering, National Taipei University of Technology, Taipei, Taiwan
fYear :
2015
Firstpage :
175
Lastpage :
179
Abstract :
With the development of social networks, people can easily share their feelings and express their opinions on interesting topics. Thus, social media mining is becoming an important research topic. However, huge volume of social Web data in various forms are rapidly generated around the world in a much faster speed than those of any other media. This can lead to the difficulties in social data analysis: noisy data and efficiency. In this paper, we propose a distributed data analytics platform for social media. First, data from various sources are collected and stored in a distributed index for efficient retrieval. Then, a distributed analytics framework is built from memory-based cluster computing based on the MapReduce paradigm. Finally, statistical analysis is performed and integrated for presentation. In the experiment, we built the platform by open source projects Hadoop and Spark, and implemented combinations of map and reduce operations. We compared the efficiency and scalability of the platform on various check-in datasets. Further evaluation is needed to verify the performance in different types of operations.
Keywords :
"Media","Big data","Sparks","Social network services","Distributed databases","Data analysis","Indexes"
Publisher :
ieee
Conference_Titel :
Wireless and Optical Communication Conference (WOCC), 2015 24th
ISSN :
2379-1268
Print_ISBN :
978-1-4799-8868-6
Electronic_ISBN :
2379-1276
Type :
conf
DOI :
10.1109/WOCC.2015.7346200
Filename :
7346200
Link To Document :
بازگشت