Title :
Effective skew handling for parallel sorting in multiprocessor database systems
Author :
Lo, Yu-lung ; Huang, Yu-Chen
Author_Institution :
Dept. of Inf. Manage., Chaoyang Univ. of Technol., Taichung, Taiwan
Abstract :
A consensus on a parallel architecture for very large database management has emerged. This architecture is based on a shared-nothing hardware organization. The computation model is very sensitive to skew in tuple distribution, however. The sorting operation is frequently used for database processing. For example sorting may be requested by users through the use of Distinct, Order By and Group By clauses in SQL. Although load balancing incurs processing costs, and therefore can have a profound influence on the optimized execution plan of a query, only few of the existing parallel sorting executions consider this factor. We present two parallel sorting algorithms using the dynamic load balancing technique to address the data skew problem. Our performance study indicates that the proposed parallel sorting techniques can provide very impressive performance improvement over conventional approaches.
Keywords :
SQL; distributed databases; multiprocessing systems; parallel algorithms; query processing; resource allocation; software performance evaluation; sorting; very large databases; SQL; load balancing; multiprocessor database systems; parallel algorithms; parallel architecture; parallel sorting; performance; query execution plan; shared-nothing hardware organization; skew handling; tuple distribution; very large database; Computational modeling; Computer architecture; Cost function; Database systems; Distributed computing; Hardware; Heuristic algorithms; Load management; Parallel architectures; Sorting;
Conference_Titel :
Parallel and Distributed Systems, 2002. Proceedings. Ninth International Conference on
Print_ISBN :
0-7695-1760-9
DOI :
10.1109/ICPADS.2002.1183392