• DocumentCode
    659627
  • Title

    Managing massive graphs in relational DBMS

  • Author

    Ruiwen Chen

  • Author_Institution
    Simon Fraser Univ., Burnaby, BC, Canada
  • fYear
    2013
  • fDate
    6-9 Oct. 2013
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Massive graphs emerge in many real-world applications. Practitioners often find relational databases are inefficient in graph data management. In this paper, we investigate the efficiency issue by analyzing both I/O and CPU costs. First, we find the storage of a graph in relational DBMS violates the locality principle: graph queries will always reference neighbors; however, the data locations of neighbors are almost random. To solve this problem, we introduce partitioned graph storage as a new database design option. It combines database partitioning with available graph-partitioning algorithms to restructure the storage such that neighbors are located close to each other. Second, we find graph queries expressed with SQL introduce unnecessary overheads. To overcome the CPU costs, we propose a new storage access method, which we call graph scan, to retrieve neighbors in one single operation. We show experimentally that partitioned graph storage and graph scan can significantly reduce I/O and CPU costs. We conclude that a relational DBMS could be a good graph store, as long as the storage respects the locality principle and SQL overheads are eliminated.
  • Keywords
    SQL; cost reduction; graph theory; relational databases; storage management; CPU cost; SQL; Structured Query Language; cost analysis; cost reduction; data locations; database design option; database partitioning; graph data management; graph queries; graph scan; graph storage; graph-partitioning algorithms; input-output cost; locality principle; massive graphs management; partitioned graph storage; relational DBMS; relational database management systems; Algorithm design and analysis; Approximation algorithms; Clustering algorithms; Indexes; Partitioning algorithms; Social network services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data, 2013 IEEE International Conference on
  • Conference_Location
    Silicon Valley, CA
  • Type

    conf

  • DOI
    10.1109/BigData.2013.6691776
  • Filename
    6691776