DocumentCode :
685911
Title :
A Method of Data Distribution for Distributed Cross Join
Author :
Ping Lu ; Shengmei Luo ; Zhiping Wang ; Wenwu Qu
fYear :
2013
fDate :
13-15 Dec. 2013
Firstpage :
105
Lastpage :
109
Abstract :
One of the major challenges in big data processing is the efficiency of cross join, such as the similarity calculation in business intelligence. In this paper we introduce an optimal data distribution algorithm for distributed cross join which combine each row from the first table with each row from the second table, which can reduce the network traffic and guarantee the computation balance of the distributed system.
Keywords :
Big Data; distributed algorithms; Big Data processing; business intelligence; computation balance; distributed cross-join efficiency; distributed system; network traffic reduction; optimal data distribution algorithm; similarity calculation; Algorithm design and analysis; Clustering algorithms; Computational modeling; Distributed databases; Niobium; Optimization; cross join; data distribution;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Cloud and Big Data (CBD), 2013 International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4799-3260-3
Type :
conf
DOI :
10.1109/CBD.2013.5
Filename :
6824581
Link To Document :
بازگشت