DocumentCode
659627
Title
Managing massive graphs in relational DBMS
Author
Ruiwen Chen
Author_Institution
Simon Fraser Univ., Burnaby, BC, Canada
fYear
2013
fDate
6-9 Oct. 2013
Firstpage
1
Lastpage
8
Abstract
Massive graphs emerge in many real-world applications. Practitioners often find relational databases are inefficient in graph data management. In this paper, we investigate the efficiency issue by analyzing both I/O and CPU costs. First, we find the storage of a graph in relational DBMS violates the locality principle: graph queries will always reference neighbors; however, the data locations of neighbors are almost random. To solve this problem, we introduce partitioned graph storage as a new database design option. It combines database partitioning with available graph-partitioning algorithms to restructure the storage such that neighbors are located close to each other. Second, we find graph queries expressed with SQL introduce unnecessary overheads. To overcome the CPU costs, we propose a new storage access method, which we call graph scan, to retrieve neighbors in one single operation. We show experimentally that partitioned graph storage and graph scan can significantly reduce I/O and CPU costs. We conclude that a relational DBMS could be a good graph store, as long as the storage respects the locality principle and SQL overheads are eliminated.
Keywords
SQL; cost reduction; graph theory; relational databases; storage management; CPU cost; SQL; Structured Query Language; cost analysis; cost reduction; data locations; database design option; database partitioning; graph data management; graph queries; graph scan; graph storage; graph-partitioning algorithms; input-output cost; locality principle; massive graphs management; partitioned graph storage; relational DBMS; relational database management systems; Algorithm design and analysis; Approximation algorithms; Clustering algorithms; Indexes; Partitioning algorithms; Social network services;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data, 2013 IEEE International Conference on
Conference_Location
Silicon Valley, CA
Type
conf
DOI
10.1109/BigData.2013.6691776
Filename
6691776
Link To Document