DocumentCode :
2543485
Title :
A cost aware adaptive multiple table join evaluation in MapReduce
Author :
Wang, Linqing ; Gao, Jun ; Wang, Tengjiao ; Li, Hongyan
Author_Institution :
Key Lab. of High Confidence Software Technol., Peking Univ., Beijing, China
fYear :
2012
fDate :
29-31 May 2012
Firstpage :
2437
Lastpage :
2441
Abstract :
Nowadays, MapReduce has become an effective tool for large scale data analysis. It is naturally designed for group-by aggregation tasks rather than join operator which is common in real analysis works. The existing join methods in MapReduce may earn different performances in different cases, which makes how to choose a good join plan from a join list difficult. The current static optimization can´t generate an efficient evaluation plan for a given join list. In this paper, we will introduce some custom join technologies and then propose an adaptive join plan generator for multiple join depending on both rule-based model and cost-based model considering the intermediate data.
Keywords :
costing; data analysis; distributed programming; knowledge based systems; optimisation; task analysis; MapReduce; adaptive join plan generator; cost-based model; custom join technology; data analysis; rule-based model; static optimization; task aggregation; Adaptation models; Computational modeling; Data models; Data processing; File systems; Generators; Optimization; Cost-based Model; Hive; MapReduce; Optimization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4673-0025-4
Type :
conf
DOI :
10.1109/FSKD.2012.6233855
Filename :
6233855
Link To Document :
بازگشت