• DocumentCode
    3704792
  • Title

    Towards an efficient platform for social big data analytics

  • Author

    Jenq-Haur Wang;Kuan-Ting Chen

  • Author_Institution
    Department of Computer Science and Information Engineering, National Taipei University of Technology, Taipei, Taiwan
  • fYear
    2015
  • Firstpage
    175
  • Lastpage
    179
  • Abstract
    With the development of social networks, people can easily share their feelings and express their opinions on interesting topics. Thus, social media mining is becoming an important research topic. However, huge volume of social Web data in various forms are rapidly generated around the world in a much faster speed than those of any other media. This can lead to the difficulties in social data analysis: noisy data and efficiency. In this paper, we propose a distributed data analytics platform for social media. First, data from various sources are collected and stored in a distributed index for efficient retrieval. Then, a distributed analytics framework is built from memory-based cluster computing based on the MapReduce paradigm. Finally, statistical analysis is performed and integrated for presentation. In the experiment, we built the platform by open source projects Hadoop and Spark, and implemented combinations of map and reduce operations. We compared the efficiency and scalability of the platform on various check-in datasets. Further evaluation is needed to verify the performance in different types of operations.
  • Keywords
    "Media","Big data","Sparks","Social network services","Distributed databases","Data analysis","Indexes"
  • Publisher
    ieee
  • Conference_Titel
    Wireless and Optical Communication Conference (WOCC), 2015 24th
  • ISSN
    2379-1268
  • Print_ISBN
    978-1-4799-8868-6
  • Electronic_ISBN
    2379-1276
  • Type

    conf

  • DOI
    10.1109/WOCC.2015.7346200
  • Filename
    7346200