• DocumentCode
    2869434
  • Title

    Adaptive parallel hash join in main-memory databases

  • Author

    Keller, Arthur M. ; Roy, Shaibal

  • Author_Institution
    Stanford Univ., CA, USA
  • fYear
    1991
  • fDate
    4-6 Dec 1991
  • Firstpage
    58
  • Lastpage
    67
  • Abstract
    Presents an algorithm for parallel hash-join computation on main-memory databases that adapts to data skew, and its implementation on the IBM RP3 multiprocessor. The algorithm exploits the random access capabilities of main memory databases to detect and counteract skew on the fly. Data skew is detected at run time by monitoring the observed frequencies of values of the join attribute and applying to them a threshold function that takes account of the distribution of workload among processors. If and when this threshold is reached for certain values of the join attribute, the computation corresponding to it is fragmented among an appropriate number of processors. Fragmentation requires some replication of input tuples-modestly increasing the total workload, but reduces the completion time significantly by reducing workload at the overloaded processor. A simplified analysis is supplemented by experiments. The description and analysis of the algorithm are based on the shared-nothing model. The implementation uses hierarchical shared memory providing non-uniform memory access
  • Keywords
    database theory; file organisation; parallel algorithms; IBM RP3 multiprocessor; adaptive parallel hash join; data skew; hierarchical shared memory; input tuples; join attribute; main-memory databases; parallel algorithm; parallel hash-join computation; random access; shared-nothing model; threshold function; Algorithm design and analysis; Computer science; Concurrent computing; Databases; Degradation; Frequency; Load management; Monitoring; Processor scheduling; Scheduling algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Information Systems, 1991., Proceedings of the First International Conference on
  • Conference_Location
    Miami Beach, FL
  • Print_ISBN
    0-8186-2295-4
  • Type

    conf

  • DOI
    10.1109/PDIS.1991.183068
  • Filename
    183068