• DocumentCode
    610387
  • Title

    Efficient snapshot retrieval over historical graph data

  • Author

    Khurana, U. ; Deshpande, A.

  • Author_Institution
    Univ. of Maryland, College Park, MD, USA
  • fYear
    2013
  • fDate
    8-12 April 2013
  • Firstpage
    997
  • Lastpage
    1008
  • Abstract
    We present a distributed graph database system to manage historical data for large evolving information networks, with the goal to enable temporal and evolutionary queries and analysis. The cornerstone of our system is a novel, user-extensible, highly tunable, and distributed hierarchical index structure called DeltaGraph, that enables compact recording of the historical network information, and that supports efficient retrieval of historical graph snapshots for single-site or parallel processing. Our system exposes a general programmatic API to process and analyze the retrieved snapshots. Along with the original graph data, DeltaGraph can also maintain and index auxiliary information; this functionality can be used to extend the structure to efficiently execute queries like subgraph pattern matching over historical data. We develop analytical models for both the storage space needed and the snapshot retrieval times to aid in choosing the right construction parameters for a specific scenario. We also present an in-memory graph data structure called GraphPool that can maintain hundreds of historical graph instances in main memory in a non-redundant manner. We present a comprehensive experimental evaluation that illustrates the effectiveness of our proposed techniques at managing historical graph information.
  • Keywords
    application program interfaces; data structures; distributed databases; information networks; query processing; DeltaGraph; GraphPool; analytical models; comprehensive experimental evaluation; distributed graph database system; distributed hierarchical index structure; evolutionary analysis; evolutionary queries; general programmatic API; historical data management; historical graph data; historical graph information management; historical graph snapshots; historical network information; in-memory graph data structure; information networks; parallel processing; single-site processing; snapshot retrieval; storage space; subgraph pattern matching; temporal queries; Analytical models; Data models; Data structures; Indexes; Memory management; Pattern matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2013 IEEE 29th International Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4673-4909-3
  • Electronic_ISBN
    1063-6382
  • Type

    conf

  • DOI
    10.1109/ICDE.2013.6544892
  • Filename
    6544892