• DocumentCode
    3140006
  • Title

    Distributed Semantic Web Data Management in HBase and MySQL Cluster

  • Author

    Franke, Craig ; Morin, Samuel ; Chebotko, Artem ; Abraham, John ; Brazier, Pearl

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Texas - Pan American, Edinburg, TX, USA
  • fYear
    2011
  • fDate
    4-9 July 2011
  • Firstpage
    105
  • Lastpage
    112
  • Abstract
    Various computing and data resources on the Web are being enhanced with machine-interpretable semantic descriptions to facilitate better search, discovery and integration. This interconnected metadata constitutes the Semantic Web, whose volume can potentially grow the scale of the Web. Efficient management of Semantic Web data, expressed using the W3C´s Resource Description Framework (RDF), is crucial for supporting new data-intensive, semantics-enabled applications. In this work, we study and compare two approaches to distributed RDF data management based on emerging cloud computing technologies and traditional relational database clustering technologies. In particular, we design distributed RDF data storage and querying schemes for HBase and MySQL Cluster and conduct an empirical comparison of these approaches on a cluster of commodity machines using datasets and queries from the Third Provenance Challenge and Lehigh University Benchmark. Our study reveals interesting patterns in query evaluation, shows that our algorithms are promising, and suggests that cloud computing has a great potential for scalable Semantic Web data management.
  • Keywords
    SQL; cloud computing; distributed processing; meta data; pattern clustering; relational databases; semantic Web; HBase; Lehigh University Benchmark; MySQL cluster; RDF; Semantic Web; W3C resource description framework; cloud computing; data resources; distributed semantic web data management; machine interpretable semantic descriptions; metadata interconnection; relational database clustering technologies; Cloud computing; Clustering algorithms; Distributed databases; Pattern matching; Resource description framework; HBase; MySQL Cluster; RDF; SPARQL; SQL; Semantic Web; cloud computing; distributed database; performance; query; scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing (CLOUD), 2011 IEEE International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    2159-6182
  • Print_ISBN
    978-1-4577-0836-7
  • Electronic_ISBN
    2159-6182
  • Type

    conf

  • DOI
    10.1109/CLOUD.2011.19
  • Filename
    6008699