مرکز منطقه ای اطلاع رساني علوم و فناوري - Distributed SPARQL over Big RDF Data: A Comparative Analysis Using Presto and MapReduce

DocumentCode :

1665004

Title :

Distributed SPARQL over Big RDF Data: A Comparative Analysis Using Presto and MapReduce

Author :

Mammo, Mulugeta ; Bansal, Srividya K.

Author_Institution :

Sch. of Comput., Inf., & Decision Syst. Eng. (SCIDSE), Arizona State Univ. Mesa, Mesa, AZ, USA

fYear :

2015

Firstpage :

Lastpage :

Abstract :

The processing of large volumes of RDF data require an efficient storage and query processing engine that can scale well with the volume of data. The initial attempts to address this issue focused on optimizing native RDF stores as well as conventional relational databases management systems. But as the volume of RDF data grew to exponential proportions, the limitations of these systems became apparent and researchers began to focus on using big data analysis tools, most notably Hadoop, to process RDF data. This paper presents a comparative analysis of performance of Presto (distributed SQL query engine) in processing big RDF data against Apache Hive. To evaluate the performance Presto for big RDF data processing, a map-reduce program and a compiler, based on Flex and Bison, were implemented. The map-reduce program loads RDF data into HDFS while the compiler translates SPARQL queries into a subset of SQL that Presto (and Hive) can understand.

Keywords :

Big Data; SQL; data analysis; query processing; Apache Hive; Bison; Flex; HDFS; Hadoop; MapReduce; Presto; big RDF data processing; big data analysis tools; distributed SPARQL; distributed SQL query engine; query processing engine; relational database management systems; resource description framework; Distributed databases; Engines; Facebook; Query processing; Resource description framework; Big Data processing; Database Performance; Evaluation; Querying; Semantic Web data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Big Data (BigData Congress), 2015 IEEE International Congress on

Conference_Location :

New York, NY

Print_ISBN :

978-1-4673-7277-0

Type :

conf

DOI :

10.1109/BigDataCongress.2015.15

Filename :

7207199

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1665004