• DocumentCode
    1358202
  • Title

    Parallel execution of hash joins in parallel databases

  • Author

    Hsiao, Hui-I ; Chen, Ming-Syan ; Yu, Philip S.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    8
  • Issue
    8
  • fYear
    1997
  • fDate
    8/1/1997 12:00:00 AM
  • Firstpage
    872
  • Lastpage
    883
  • Abstract
    We explore two important issues, processor allocation and the use of hash filters, to improve the parallel execution of hash joins. To exploit the opportunity of pipelining for hash join execution, a scheme to transform a bushy execution tree to an allocation tree is first devised. In an allocation tree, each node denotes a pipeline. Then, using the concept of synchronous execution time, processors are allocated to the nodes in the allocation tree in such a way that inner relations in a pipeline can be made available at approximately the same time. Also, the approach of hash filtering is investigated to further improve the parallel execution of hash joins. Extensive performance studies are conducted via simulation to demonstrate the importance of processor allocation and to evaluate various schemes using hash filters. It is experimentally shown that processor allocation is, in general, the dominant factor of performance, and the effect of hash filtering becomes more prominent as the number of relations in a query increases
  • Keywords
    database theory; distributed databases; parallel processing; query processing; relational algebra; relational databases; resource allocation; software performance evaluation; tree data structures; allocation tree; bushy execution tree; hash filters; hash join parallel execution; parallel databases; performance; pipelining; processor allocation; query processing; relational algebra; simulation; synchronous execution time; Costs; Database machines; Filtering; Filters; Industrial relations; Multiprocessing systems; Pipeline processing; Processor scheduling; Query processing; Relational databases;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/71.605772
  • Filename
    605772