• DocumentCode
    3140184
  • Title

    Scalable Complex Query Processing over Large Semantic Web Data Using Cloud

  • Author

    Husain, Mohammad Farhan ; McGlothlin, James ; Khan, Latifur ; Thuraisingham, Bhavani

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Texas at Dallas, Richardson, TX, USA
  • fYear
    2011
  • fDate
    4-9 July 2011
  • Firstpage
    187
  • Lastpage
    194
  • Abstract
    Cloud computing solutions continue to grow increasingly popular both in research and in the commercial IT industry. With this popularity comes ever increasing challenges for the cloud computing service providers. Semantic web is another domain of rapid growth in both research and industry. RDF datasets are becoming increasingly large and complex and existing solutions do not scale adequately. In this paper, we will detail a scalable semantic web framework built using cloud computing technologies. We define solutions for generating and executing optimal query plans. We handle not only queries with Basic Graph Patterns (BGP) but also complex queries with optional blocks. We have devised a novel algorithm to handle these complex queries. Our algorithm minimizes binding triple patterns and joins between them by identifying common blocks by algorithms to find sub graph isomorphism and building a query plan utilizing that information. We utilize Hadoop´s MapReduce framework to process our query plan. We will show that our framework is extremely scalable and efficiently answers complex queries.
  • Keywords
    cloud computing; electronic data interchange; graph theory; meta data; query processing; semantic Web; MapReduce framework; RDF datasets; basic graph patterns; cloud computing solutions; commercial IT industry; large semantic Web data; scalable complex query processing; Cloud computing; Heuristic algorithms; Ontologies; Query processing; Resource description framework; US Department of Energy; Cloud; Hadoop; RDF; Semantic Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing (CLOUD), 2011 IEEE International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    2159-6182
  • Print_ISBN
    978-1-4577-0836-7
  • Electronic_ISBN
    2159-6182
  • Type

    conf

  • DOI
    10.1109/CLOUD.2011.15
  • Filename
    6008709