• DocumentCode
    624124
  • Title

    Experimental framework for searching large RDF on GPUs based on key-value storage

  • Author

    Choksuchat, Chidchanok ; Chantrapornchai, Chantana

  • Author_Institution
    Dept. of Comput., Silpakorn Univ., Nakhon Pathom, Thailand
  • fYear
    2013
  • fDate
    29-31 May 2013
  • Firstpage
    171
  • Lastpage
    176
  • Abstract
    Resource Description Framework (RDF) is commonly used for the semantic web query. During this decade, due to big data processing, the large numbers of RDF triples are crawled. The triples usually stored distributed on the clouds storage or the large clusters. To search for the query answer, it is usually difficult to handle the search across platforms. Also, the search takes a long executed time. Thus, the data representation and platform are important to speedup the search and handle the heterogeneousness. In this paper, we present the experimental framework which can be used to handle the search of RDF data in GPU clusters. Our framework uses the Java platform to manipulate the semantic query while using JCuda1 to perform the GPU processing. Apache Cassandra storage, known as CumulusRDF, is used to store key-values for searching. In the experiments, DBpedia and Freebase dataset are extracted and manipulated. The triple structures are transformed and loaded into Apache Cassandra storage as CumulusRDF´s flat layout. The subject-predicate-object keys are kept in CQL caching. There are about 3-hundred-million tags that can be handled with in one machine, which can reduce time, with an inexpensive cost. We shape the data grid from row-major-ordering of Java, to GPU thread grid of CUDA, retrieved keys to join for finding the correspondence of the RDF graph.
  • Keywords
    Java; data structures; graphics processing units; query processing; semantic Web; Apache Cassandra storage; CQL caching; CUDA; CumulusRDF; GPU processing; Java platform; RDF; clouds storage; data representation; key-value storage; query answer; resource description framework; semantic Web data; semantic query; subject-predicate-object keys; Arrays; Graphics processing units; Indexes; Instruction sets; Java; Kernel; Resource description framework; GPU; RDF; concurrent Java; memory management; parallel processing; parallel string matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Software Engineering (JCSSE), 2013 10th International Joint Conference on
  • Conference_Location
    Maha Sarakham
  • Print_ISBN
    978-1-4799-0805-9
  • Type

    conf

  • DOI
    10.1109/JCSSE.2013.6567340
  • Filename
    6567340