DocumentCode
518712
Title
Distribution of relations in parallel database based on PC clusters
Author
Ping, Xuan
Author_Institution
Sch. of Comput. Sci. & Technol., Heilongjiang Univ., Harbin, China
Volume
3
fYear
2010
fDate
27-29 March 2010
Firstpage
352
Lastpage
356
Abstract
In parallel database system, optimizing distribution of relations could improve processing efficiency of multi-join queries greatly. The cost of data communication is expensive in parallel system based on PC clusters. This paper proposes distribution of relations algorithm to select an appropriate data placement strategy for each relation, which includes selection of distribution attributes and nodes. The algorithm could make best use of intra-operator parallelism, independent inter-operator parallelism and pipelined parallelism of PC clusters system. At the same time, it could reduce additional communication cost of data redistribution. The result of experiment indicates the algorithm has good performance and contributes to promoting execution efficiency of parallel multi-join queries.
Keywords
parallel processing; query processing; relational databases; PC clusters; data placement strategy; independent inter-operator parallelism; intra-operator parallelism; multijoin query processing; parallel database; pipelined parallelism; relation distribution; Bandwidth; Clustering algorithms; Computer science; Cost function; Data communication; Multidimensional systems; Parallel processing; Pipelines; Probability; Relational databases; PC clusters; data redistribution; multi-join query; pipelined parallelism;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Computer Control (ICACC), 2010 2nd International Conference on
Conference_Location
Shenyang
Print_ISBN
978-1-4244-5845-5
Type
conf
DOI
10.1109/ICACC.2010.5486843
Filename
5486843
Link To Document