• DocumentCode
    1791597
  • Title

    Rainbow: A distributed and hierarchical RDF triple store with dynamic scalability

  • Author

    Rong Gu ; Wei Hu ; Yihua Huang

  • Author_Institution
    Nat. Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China
  • fYear
    2014
  • fDate
    27-30 Oct. 2014
  • Firstpage
    561
  • Lastpage
    566
  • Abstract
    In the Big Data era, the ever-increasing RDF data have reached a scale in billions of triples and brought obstacles and challenges to single-node RDF data stores. As a result, many distributed RDF stores have been emerging in the Semantic Web community recently. However, currently published ones are either not enough efficient on performance or failed to achieve flexible scalability. In this paper, we propose Rainbow, a scalable and efficient RDF triple store. The RDF data indexing scheme in Rainbow is a hybrid one which is designed based on the statistical analysis of user query space. Further, to better support the hybrid indexing scheme, Rainbow adopts a distributed and hierarchical storage architecture that uses HBase as the scalable persistent storage and combines a distributed memory storage to speedup query performance. The RDF data in memory storage is partitioned by the consistent hashing algorithm to achieve the dynamic scalability. Experiments show that Rainbow outperforms typical existing distributed RDF triple stores, with excellent scalability and fault tolerance.
  • Keywords
    Big Data; database indexing; distributed processing; fault tolerant computing; query processing; semantic Web; statistical analysis; Big Data; HBase; RDF data indexing scheme; RDF data partitioning; Rainbow; consistent hashing algorithm; distributed hierarchical RDF triple store; distributed hierarchical storage architecture; distributed memory storage; dynamic scalability; fault tolerance; hybrid indexing scheme; query performance; scalable persistent storage; semantic Web community; single-node RDF data stores; statistical analysis; user query space; Distributed databases; Fault tolerance; Fault tolerant systems; Indexing; Pattern matching; Resource description framework; Scalability; RDF; SPARQL; big data; distributed computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data (Big Data), 2014 IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Type

    conf

  • DOI
    10.1109/BigData.2014.7004274
  • Filename
    7004274