• DocumentCode
    1426205
  • Title

    Performance analysis of parallel query processing algorithms for object-oriented databases

  • Author

    Su, Stanley Y W ; Ranka, Sanjay ; He, Xiang

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Florida Univ., Gainesville, FL, USA
  • Volume
    12
  • Issue
    6
  • fYear
    2000
  • Firstpage
    979
  • Lastpage
    996
  • Abstract
    Two types of parallel processing and optimization algorithms for processing object-oriented databases are the hybrid-hash pointer-based (HHP) algorithms and multi-wavefront (MWF) algorithms. We analyze these two algorithms and develop analytical formulas to capture their main performance features. We study their performance in three application environments, characterized by large databases having many object classes, each of which, respectively, (1) contains a large number of instances; (2) contains a relatively small number of instances; and (3) is of varying size. A horizontal data partitioning strategy is used in (1). A class-per-node assignment strategy is used in (2). In (3), object classes are partitioned horizontally and assigned to a varying number of processors depending on their different sizes. The MWF algorithm has three distinguishing features which contribute to its better performance: (a) a two-phase processing strategy, (b) vertical partitioning of horizontal segments, and (c) dynamic determination of the collision point in MWF propagations, which results in an optimized query execution plan. If these features are adopted by an HHP algorithm, its performance is comparable with that of the MWF algorithm because the difference in CPU time between them is negligible. The computing environment is a network of workstations having a shared-nothing architecture. The schema and some queries selected from the OO7 benchmark are used in the performance analyses and comparisons. The queries are modified slightly in different data environments in order to reflect the features of diverse database applications
  • Keywords
    file organisation; object-oriented databases; parallel algorithms; parallel databases; query processing; software performance evaluation; workstation clusters; CPU time; OO7 database benchmark; application environments; class-per-node assignment strategy; data environments; database applications; dynamic collision point determination; horizontal data partitioning strategy; horizontal segments; hybrid-hash pointer-based algorithms; instance numbers; large databases; multi-wavefront algorithms; object classes; object-oriented databases; optimization algorithms; optimized query execution plan; parallel query processing algorithms; performance analysis; processor numbers; shared-nothing architecture; two-phase processing strategy; vertical partitioning; workstation network; Algorithm design and analysis; Computer architecture; Computer networks; Object oriented databases; Parallel processing; Partitioning algorithms; Performance analysis; Query processing; Spatial databases; Workstations;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.895805
  • Filename
    895805