DocumentCode :
659627
Title :
Managing massive graphs in relational DBMS
Author :
Ruiwen Chen
Author_Institution :
Simon Fraser Univ., Burnaby, BC, Canada
fYear :
2013
fDate :
6-9 Oct. 2013
Firstpage :
1
Lastpage :
8
Abstract :
Massive graphs emerge in many real-world applications. Practitioners often find relational databases are inefficient in graph data management. In this paper, we investigate the efficiency issue by analyzing both I/O and CPU costs. First, we find the storage of a graph in relational DBMS violates the locality principle: graph queries will always reference neighbors; however, the data locations of neighbors are almost random. To solve this problem, we introduce partitioned graph storage as a new database design option. It combines database partitioning with available graph-partitioning algorithms to restructure the storage such that neighbors are located close to each other. Second, we find graph queries expressed with SQL introduce unnecessary overheads. To overcome the CPU costs, we propose a new storage access method, which we call graph scan, to retrieve neighbors in one single operation. We show experimentally that partitioned graph storage and graph scan can significantly reduce I/O and CPU costs. We conclude that a relational DBMS could be a good graph store, as long as the storage respects the locality principle and SQL overheads are eliminated.
Keywords :
SQL; cost reduction; graph theory; relational databases; storage management; CPU cost; SQL; Structured Query Language; cost analysis; cost reduction; data locations; database design option; database partitioning; graph data management; graph queries; graph scan; graph storage; graph-partitioning algorithms; input-output cost; locality principle; massive graphs management; partitioned graph storage; relational DBMS; relational database management systems; Algorithm design and analysis; Approximation algorithms; Clustering algorithms; Indexes; Partitioning algorithms; Social network services;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data, 2013 IEEE International Conference on
Conference_Location :
Silicon Valley, CA
Type :
conf
DOI :
10.1109/BigData.2013.6691776
Filename :
6691776
Link To Document :
بازگشت