• DocumentCode
    1361192
  • Title

    The design and implementation of seeded trees: an efficient method for spatial joins

  • Author

    Lo, Ming-Ling ; Ravishankar, Chinya V.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    10
  • Issue
    1
  • fYear
    1998
  • Firstpage
    136
  • Lastpage
    152
  • Abstract
    Existing methods for spatial joins require pre-existing spatial indices or other precomputation, but such approaches are inefficient and limited in generality. Operand data sets of spatial joins may not all have precomputed indices, particularly when they are dynamically generated by other selection or join operations. Also, existing spatial indices are mostly designed for spatial selections, and are not always efficient for joins. This paper explores the design and implementation of seeded trees, which are effective for spatial joins and efficient to construct at join time. Seeded trees are R-tree-like structures, but divided into seed levels and grown levels. This structure facilitates using information regarding the join to accelerate the join process, and allows efficient buffer management. In addition to the basic structure and behavior of seeded trees we present techniques for efficient seeded tree construction, a new buffer management strategy to lower I/O costs, and theoretical analysis for choosing algorithmic parameters. We also present methods for reducing space requirements and improving the stability of seeded tree performance with no additional I/O costs. Our performance studies show that the seeded tree method outperforms other tree-based methods by far both in terms of the number disk pages accessed and weighted I/O costs. Further, its performance gain is stable across different input data, and its incurred CPU penalties are also lower
  • Keywords
    database theory; spatial data structures; tree data structures; visual databases; CPU penalties; R-tree-like; buffer management; grown levels; performance gain; seed levels; seeded trees; space requirements; spatial indices; spatial joins; Acceleration; Algorithm design and analysis; Costs; Geographic Information Systems; Performance gain; Query processing; Sorting; Spatial databases; Spatial indexes; Stability;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.667097
  • Filename
    667097