• DocumentCode
    1689767
  • Title

    Declustering large multidimensional data sets for range queries over heterogeneous disks

  • Author

    Lee, Jonghyun ; Winslett, Marianne ; Ma, Xiaosong ; Yu, Shengke

  • Author_Institution
    Dept. of Comput. Sci., Illinois Univ., Urbana, IL, USA
  • fYear
    2003
  • Firstpage
    212
  • Lastpage
    221
  • Abstract
    Declustering is a technique to distribute data sets over multiple disks so that future retrievals can be well balanced over the disks and be performed in parallel. Although clusters often have heterogeneous disks, most declustering work has focused only on homogeneous environments. In this work, we investigate the declustering problem for a heterogeneous disk environment using virtual servers, and propose approaches for deciding the number of virtual servers and the mapping between virtual servers and physical disks. Our experimental results show that by combining our algorithm for choosing the number of virtual servers with a greedy algorithm for mapping virtual servers to disks, users can expect range query retrieval performance within 4% of the optimum achievable in practice on average, in all configurations studied. Compared to an intuitively natural approach to the problem, this represents an improvement of 8-31% in average fetch ratio, as well a 26-38% reduction in the standard deviation of performance for small queries.
  • Keywords
    data handling; data models; database management systems; disc storage; distributed processing; storage allocation; data set declustering; greedy algorithm; heterogeneous disk; large multidimensional data set; multiple disks; range query; virtual server; Aggregates; Bandwidth; Computer science; Data visualization; Databases; Greedy algorithms; Image retrieval; Information retrieval; Multidimensional systems; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Management, 2003. 15th International Conference on
  • ISSN
    1099-3371
  • Print_ISBN
    0-7695-1964-4
  • Type

    conf

  • DOI
    10.1109/SSDM.2003.1214982
  • Filename
    1214982