• DocumentCode
    1100420
  • Title

    Dynamic load balancing in multicomputer database systems using partition tuning

  • Author

    Hua, Kien A. ; Lee, Chiang ; Hua, Chau M.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Central Florida, Orlando, FL, USA
  • Volume
    7
  • Issue
    6
  • fYear
    1995
  • fDate
    12/1/1995 12:00:00 AM
  • Firstpage
    968
  • Lastpage
    983
  • Abstract
    Shared nothing multiprocessor architecture is known to be more scalable to support very large databases. Compared to other join strategies, a hash-based join algorithm is particularly efficient and easily parallelized for this computation model. However, this hardware structure is very sensitive to the skew in tuple distribution. Unless the parallel hash join algorithm includes some dynamic load balancing mechanism, the skew effect can severely deteriorate the system performance. In this paper, we investigate this issue. In particular, three parallel hash join algorithms are presented. We implement a simulator to study the effectiveness of these schemes. The simulation model is validated by comparing the simulation results to those produced by the actual implementation of the algorithms running on a multiprocessor system. Our performance study indicates that a naive approach is not able to provide tangible savings. However, the carefully designed strategies can offer substantial improvement over conventional techniques for a wide range of skew conditions
  • Keywords
    database machines; database theory; distributed databases; parallel algorithms; query processing; relational databases; resource allocation; software performance evaluation; very large databases; computation model; database machine; dynamic load balancing; hash-based join algorithm; multicomputer database; multiprocessor system; parallel hash join algorithm; parallel hash join algorithms; partition tuning; performance study; relational database; scalable architecture; shared nothing multiprocessor architecture; simulation model; skew; system performance; tuple distribution; very large databases; Computational modeling; Computer architecture; Concurrent computing; Database systems; Hardware; Heuristic algorithms; Load management; Partitioning algorithms; Relational databases; System performance;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.476502
  • Filename
    476502