Title :
A cost aware adaptive multiple table join evaluation in MapReduce
Author :
Wang, Linqing ; Gao, Jun ; Wang, Tengjiao ; Li, Hongyan
Author_Institution :
Key Lab. of High Confidence Software Technol., Peking Univ., Beijing, China
Abstract :
Nowadays, MapReduce has become an effective tool for large scale data analysis. It is naturally designed for group-by aggregation tasks rather than join operator which is common in real analysis works. The existing join methods in MapReduce may earn different performances in different cases, which makes how to choose a good join plan from a join list difficult. The current static optimization can´t generate an efficient evaluation plan for a given join list. In this paper, we will introduce some custom join technologies and then propose an adaptive join plan generator for multiple join depending on both rule-based model and cost-based model considering the intermediate data.
Keywords :
costing; data analysis; distributed programming; knowledge based systems; optimisation; task analysis; MapReduce; adaptive join plan generator; cost-based model; custom join technology; data analysis; rule-based model; static optimization; task aggregation; Adaptation models; Computational modeling; Data models; Data processing; File systems; Generators; Optimization; Cost-based Model; Hive; MapReduce; Optimization;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4673-0025-4
DOI :
10.1109/FSKD.2012.6233855