DocumentCode
685911
Title
A Method of Data Distribution for Distributed Cross Join
Author
Ping Lu ; Shengmei Luo ; Zhiping Wang ; Wenwu Qu
fYear
2013
fDate
13-15 Dec. 2013
Firstpage
105
Lastpage
109
Abstract
One of the major challenges in big data processing is the efficiency of cross join, such as the similarity calculation in business intelligence. In this paper we introduce an optimal data distribution algorithm for distributed cross join which combine each row from the first table with each row from the second table, which can reduce the network traffic and guarantee the computation balance of the distributed system.
Keywords
Big Data; distributed algorithms; Big Data processing; business intelligence; computation balance; distributed cross-join efficiency; distributed system; network traffic reduction; optimal data distribution algorithm; similarity calculation; Algorithm design and analysis; Clustering algorithms; Computational modeling; Distributed databases; Niobium; Optimization; cross join; data distribution;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Cloud and Big Data (CBD), 2013 International Conference on
Conference_Location
Nanjing
Print_ISBN
978-1-4799-3260-3
Type
conf
DOI
10.1109/CBD.2013.5
Filename
6824581
Link To Document