DocumentCode :
659628
Title :
A distributed approach for graph-oriented multidimensional analysis
Author :
Denis, Benoit ; Ghrab, Amine ; Skhiri, Sabri
Author_Institution :
Univ. Catholique de Louvain, Louvain-la-Neuve, Belgium
fYear :
2013
fDate :
6-9 Oct. 2013
Firstpage :
9
Lastpage :
16
Abstract :
The importance of graphs as the fundamental structure underpinning many real world applications is no longer to be proved. Large graphs have emerged in various fields such as biological, social and transportation networks. The sheer volume of these networks poses challenges to traditional techniques for storage and analysis of graph data. In particular, OLAP analysis requires access to large portions of data to extract key information and to feed strategic decision making. OLAP provides multilevel, multiperspective views of the data. Most of the current techniques are optimized for centralized graph processing. A distributed approach providing horizontal scalability is required in order to handle the analysis workload. In this paper, we focus on applying OLAP analysis on large, distributed graph data. We describe Distributed Graph Cube, our distributed framework for graph-based OLAP cubes computation and aggregation. Experimental results on large, real-world datasets demonstrate that our method significantly outperforms its centralized counterparts. We also evaluate the performance of both Hadoop and Spark for distributed cubes computations.
Keywords :
data analysis; data mining; decision making; distributed processing; graph theory; storage management; Hadoop; OLAP analysis; Spark; analysis workload; centralized graph processing; distributed approach; distributed cubes computations; distributed graph cube; graph data analysis; graph data storage; graph-based OLAP cubes aggregation; graph-based OLAP cubes computation; graph-oriented multidimensional analysis; horizontal scalability; information extraction; online analytical processing; strategic decision making; Aggregates; Airports; Information management; Random access memory; Scalability; Sparks; Time complexity; Distributed Graph Processing; Large Multidimensional Networks; OLAP Cubes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data, 2013 IEEE International Conference on
Conference_Location :
Silicon Valley, CA
Type :
conf
DOI :
10.1109/BigData.2013.6691777
Filename :
6691777
Link To Document :
بازگشت