Title :
Permutation-based range-join algorithms on N-dimensional meshes
Author :
Chen, Shao Dong ; Shen, Hong ; Topor, Rodney
Author_Institution :
Hutchison Telecommun. Ltd., Kowloon, China
fDate :
4/1/2002 12:00:00 AM
Abstract :
We present four efficient parallel algorithms for computing a nonequijoin, called range-join, of two relations on N-dimensional mesh-connected computers. Range-joins of relations R and S are an important generalization of conventional equijoins and band-joins and are solved by permutation-based approaches in all proposed algorithms. In general, after sorting all subsets of both relations, the proposed algorithms permute every sorted subset of relation S to each processor in turn, where it is joined with the local subset of relation R. To permute the subsets of S efficiently, we propose two data permutation approaches, namely, the shifting approach which permutes the data recursively from lower dimensions to higher dimensions and the Hamiltonian-cycle approach which first constructs a Hamiltonian cycle on the mesh and then permutes the data along this cycle by repeatedly transferring data from each processor to its successor. We apply the shifting approach to meshes with different storage capacities which results in two different join algorithms. The basic shifting join (BASHJ) algorithm can minimize the number of subsets stored temporarily at a processor, but requires a large number of data transmissions, while the buffering shifting join (BUSHJ) algorithm can achieve a high parallelism and minimize the number of data transmissions, but requires a large number of subsets stored at each processor
Keywords :
database theory; parallel algorithms; parallel databases; relational databases; sorting; Hamiltonian-cycle approach; N-dimensional mesh-connected computers; band-joins; basic shifting join algorithm; buffering shifting join algorithm; data permutation approaches; nonequijoin; parallel Hamiltonian-cycle join algorithm; parallel algorithms; permutation-based range-join algorithms; recursive Hamiltonian-cycle join algorithm; shifting approach; sorting; storage capacities; Buffer storage; Concurrent computing; Data communication; Parallel algorithms; Parallel processing; Sorting;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on