مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

3765918

Title :

OLAP query performance tuning in Spark

Author :

Yanfei Lv; Huihong He; Yasong Zheng; Zhe Liu; Hong Zhang

Author_Institution :

National Computer Network Emergency Response Technical Team/Coordination Center of China, Beijing 100029, China

fYear :

2015

Firstpage :

Lastpage :

Abstract :

OLAP query is an efficient way to gain quick insight into big data. Spark is a fast and general engine for big data processing, which supports interactive OLAP queries. Nevertheless, as a general engine, there are many parameters that affect the performance of the Spark, and thus it is necessary to study the appropriate setting in order to gain better performance on a specific scenario. In this paper, we choose typical queries from real scenarios, and present measurement results that are obtained by perform these queries on real dataset up to 7T13 on a 32 nodes cluster. After tuning, queries gain obvious performance improvement against the default setting.

Publisher :

iet

Conference_Titel :

Cyberspace Technology (CCT 2015), Third International Conference on

Print_ISBN :

978-1-78561-089-9

Type :

conf

DOI :

10.1049/cp.2015.0832

Filename :

7446924

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3765918