• DocumentCode
    1306546
  • Title

    Efficient bulk-loading of gridfiles

  • Author

    Leutenegger, Scott T. ; Nicol, David M.

  • Author_Institution
    Dept. of Math. & Comput. Sci., Denver Univ., CO, USA
  • Volume
    9
  • Issue
    3
  • fYear
    1997
  • Firstpage
    410
  • Lastpage
    420
  • Abstract
    This paper considers the problem of bulk-loading large data sets for the gridfile multiattribute indexing technique. We propose a rectilinear partitioning algorithm that heuristically seeks to minimize the size of the gridfile needed to ensure no bucket overflows. Empirical studies on both synthetic data sets and on data sets drawn from computational fluid dynamics applications demonstrate that our algorithm is very efficient, and is able to handle large data sets. In addition, we present an algorithm for bulk-loading data sets too large to fit in main memory. Utilizing a sort of the entire data set it creates a gridfile without incurring any overflows
  • Keywords
    data structures; database management systems; file organisation; bucket overflows; computational fluid dynamics; gridfiles; large data sets; multiattribute indexing; rectilinear partitioning; Computational fluid dynamics; Data visualization; Dynamic programming; Heuristic algorithms; Indexing; Information retrieval; Multidimensional systems; Partitioning algorithms; Relational databases; Visual databases;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.599930
  • Filename
    599930