DocumentCode :
518712
Title :
Distribution of relations in parallel database based on PC clusters
Author :
Ping, Xuan
Author_Institution :
Sch. of Comput. Sci. & Technol., Heilongjiang Univ., Harbin, China
Volume :
3
fYear :
2010
fDate :
27-29 March 2010
Firstpage :
352
Lastpage :
356
Abstract :
In parallel database system, optimizing distribution of relations could improve processing efficiency of multi-join queries greatly. The cost of data communication is expensive in parallel system based on PC clusters. This paper proposes distribution of relations algorithm to select an appropriate data placement strategy for each relation, which includes selection of distribution attributes and nodes. The algorithm could make best use of intra-operator parallelism, independent inter-operator parallelism and pipelined parallelism of PC clusters system. At the same time, it could reduce additional communication cost of data redistribution. The result of experiment indicates the algorithm has good performance and contributes to promoting execution efficiency of parallel multi-join queries.
Keywords :
parallel processing; query processing; relational databases; PC clusters; data placement strategy; independent inter-operator parallelism; intra-operator parallelism; multijoin query processing; parallel database; pipelined parallelism; relation distribution; Bandwidth; Clustering algorithms; Computer science; Cost function; Data communication; Multidimensional systems; Parallel processing; Pipelines; Probability; Relational databases; PC clusters; data redistribution; multi-join query; pipelined parallelism;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Computer Control (ICACC), 2010 2nd International Conference on
Conference_Location :
Shenyang
Print_ISBN :
978-1-4244-5845-5
Type :
conf
DOI :
10.1109/ICACC.2010.5486843
Filename :
5486843
Link To Document :
بازگشت