DocumentCode :
1662781
Title :
TripleCloud: An Infrastructure for Exploratory Querying over Web-Scale RDF Data
Author :
Guéret, Christophe ; Kotoulas, Spyros ; Groth, Paul
Author_Institution :
VU Univ. Amsterdam, Amsterdam, Netherlands
Volume :
3
fYear :
2011
Firstpage :
245
Lastpage :
248
Abstract :
As the availability of large scale RDF data sets has grown, there has been a corresponding growth in researchers´ and practitioners´ interest in analyzing and investigating these data sets. However, given their size and messiness, there is significant overhead in setting up the infrastructure to store and query them. In this paper, we present Triple Cloud, a system that aims to lower the entry cost to exploring Web-scale RDF data sets. The system takes advantage of existing cloud based key-value stores (e.g.BigTable, HBase) to both enable scalability as well as hide the complexities of infrastructure deployment and maintenance. It layers over these key-value stores a robust query engine able to return approximate answers. We test the scalability of the approach scaling to over 3 billion triples for complex queries. In addition to an implementation over HBase, Triple Cloud runs over the Google App Engine, allowing us to perform a cost evaluation of the approach.
Keywords :
Internet; query processing; search engines; Google App Engine; TripleCloud; Web-scale RDF data; cloud based key-value store; complex query; exploratory query; robust query engine; Distributed databases; Engines; Google; Resource description framework; Scalability; Servers; Cloud Computing; Key-value stores; RDF; SPARQL;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
Conference_Location :
Lyon
Print_ISBN :
978-1-4577-1373-6
Electronic_ISBN :
978-0-7695-4513-4
Type :
conf
DOI :
10.1109/WI-IAT.2011.166
Filename :
6040851
Link To Document :
بازگشت