• DocumentCode
    3678361
  • Title

    High-Performance, Distributed Dictionary Encoding of RDF Datasets

  • Author

    Alessandro Morari;Jesse Weaver;Oreste Villa;David Haglin;Antonino Tumeo;Vito Giovanni Castellana;John Feo

  • Author_Institution
    Pacific Northwest Nat. Lab., Richland, WA, USA
  • fYear
    2015
  • Firstpage
    250
  • Lastpage
    253
  • Abstract
    In this work we propose a novel approach for RDF (Resource Description Framework) dictionary encoding that employs a parallel RDF parser and a distributed dictionary data structure, exploiting RDF-specific optimizations. In contrast with previous solutions, this approach exploits the Partitioned Global Address Space (PGAS) programming model combined with active messages. We evaluate the performance of our dictionary encoder in our RDF database, GEMS (Graph Engine for Multithreaded Systems), and provide an empirical comparison against previous approaches. Our comparison shows that our dictionary encoder scales significantly better and achieves higher performance than the current state of the art, providing a key element for the realization of a more efficient RDF database.
  • Keywords
    "Resource description framework","Encoding","Dictionaries","Databases","Data structures","Throughput","Benchmark testing"
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing (CLUSTER), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/CLUSTER.2015.44
  • Filename
    7307591