Title :
Building a Data Warehouse for Twitter Stream Exploration
Author :
Rehman, Naveed Ur ; Mansmann, S. ; Weiler, Andreas ; Scholl, Marc H.
Author_Institution :
Univ. of Konstanz, Konstanz, Germany
Abstract :
In the recent year Twitter has evolved into an extremely popular social network and has revolutionized the ways of interacting and exchanging information on the Internet. By making its public stream available through a set of APIs Twitter has triggered a wave of research initiatives aimed at analysis and knowledge discovery from the data about its users and their messaging activities. While most of the projects and tools are tailored towards solving specific tasks, we pursue a goal of providing an application in dependent and universal analytical platform for supporting any kind of analysis and knowledge discovery. We employ the well established data warehousing technology with its underlying multidimensional data model, ETL routine for loading and consolidating data from different sources, OLAP functionality for exploring the data and data mining tools for more sophisticated analysis. In this work we describe the process of transforming the original stream into a set of related multidimensional cubes and demonstrate how the resulting data warehouse can be used for solving a variety of analytical tasks. We expect our proposed approach to be applicable for analyzing the data of other social networks as well.
Keywords :
Internet; application program interfaces; data mining; data models; data warehouses; social networking (online); API Twitter; ETL routine; Internet; OLAP functionality; Twitter stream exploration; data consolidation; data loading; data mining tools; data warehouse; knowledge discovery; messaging activities; multidimensional cubes; multidimensional data model; social network; Data mining; Data models; Data warehouses; Media; Twitter; Warehousing;
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2012 IEEE/ACM International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4673-2497-7
DOI :
10.1109/ASONAM.2012.230