DocumentCode :
140834
Title :
GLog: A high level graph analysis system using MapReduce
Author :
Jun Gao ; Jiashuai Zhou ; Chang Zhou ; Yu, Jeffrey Xu
Author_Institution :
Key Lab. of High Confidence Software Technol., Peking Univ., Beijing, China
fYear :
2014
fDate :
March 31 2014-April 4 2014
Firstpage :
544
Lastpage :
555
Abstract :
With the rapid growth of graphs in different applications, it is inevitable to leverage existing distributed data processing frameworks in managing large graphs. Although these frameworks ease the developing cost, it is still cumbersome and error-prone for developers to implement complex graph analysis tasks in distributed environments. Additionally, developers have to learn the details of these frameworks quite well, which is a key to improve the performance of distributed jobs. This paper introduces a high level query language called GLog and proposes its evaluation method to overcome these limitations. Specifically, we first design a RG (Relational-Graph) data model to mix relational data and graph data, and extend Datalog to GLog on RG tables to support various graph analysis tasks. Second, we define operations on RG tables, and show translation templates to convert a GLog query into a sequence of MapReduce jobs. Third, we propose two strategies, namely rule merging and iteration rewriting, to optimize the translated jobs. The final experiments show that GLog can not only express various graph analysis tasks in a more succinct way, but also achieve a better performance for most of the graph analysis tasks than Pig, another high level dataflow system.
Keywords :
DATALOG; data flow computing; graph theory; relational databases; DATALOG; GLog; MapReduce; Pig; complex graph analysis; distributed data processing; high level dataflow system; high level graph analysis system; high level query language; relational-graph data model; Algebra; Computer languages; Educational institutions; Engines; Indexes; Merging; Optimization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2014 IEEE 30th International Conference on
Conference_Location :
Chicago, IL
Type :
conf
DOI :
10.1109/ICDE.2014.6816680
Filename :
6816680
Link To Document :
بازگشت