DocumentCode :
3765918
Title :
OLAP query performance tuning in Spark
Author :
Yanfei Lv; Huihong He; Yasong Zheng; Zhe Liu; Hong Zhang
Author_Institution :
National Computer Network Emergency Response Technical Team/Coordination Center of China, Beijing 100029, China
fYear :
2015
Firstpage :
1
Lastpage :
5
Abstract :
OLAP query is an efficient way to gain quick insight into big data. Spark is a fast and general engine for big data processing, which supports interactive OLAP queries. Nevertheless, as a general engine, there are many parameters that affect the performance of the Spark, and thus it is necessary to study the appropriate setting in order to gain better performance on a specific scenario. In this paper, we choose typical queries from real scenarios, and present measurement results that are obtained by perform these queries on real dataset up to 7T13 on a 32 nodes cluster. After tuning, queries gain obvious performance improvement against the default setting.
Publisher :
iet
Conference_Titel :
Cyberspace Technology (CCT 2015), Third International Conference on
Print_ISBN :
978-1-78561-089-9
Type :
conf
DOI :
10.1049/cp.2015.0832
Filename :
7446924
Link To Document :
بازگشت