Title :
A similarity graph-based approach to declustering problems and its application towards parallelizing grid files
Author :
Liu, Duen-Ren ; Shekhar, Shashi
Author_Institution :
Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
Abstract :
We propose a new similarity-based technique for declustering data. The proposed method can adapt to available information about query distributions, data distributions, data sizes and partition-size constraints. The method is based on max-cut partitioning of a similarity graph defined over the given set of data, under constraints on the partition sizes. It maximizes the chances that a pair of data-items that are to be accessed together by queries are allocated to distinct disks. We show that the proposed method can achieve optimal speed-up for a query-set, if there exists any other declustering method which will achieve the optimal speed-up. Experiments in parallelizing grid files show that the proposed method outperforms mapping-function-based methods for interesting query distributions as well for non-uniform data distributions
Keywords :
data handling; graph theory; input-output programs; query processing; storage allocation; storage management; data declustering problems; data distributions; data sizes; data-item pair; distinct disks; max-cut partitioning; nonuniform data distributions; optimal speed-up; parallelizing grid files; partition size constraints; query distributions; query-set; similarity graph-based approach; Application software; Computer science; Concurrent computing; Databases; Delay; Error correction codes; Frequency; Lattices; Polynomials; Round robin;
Conference_Titel :
Data Engineering, 1995. Proceedings of the Eleventh International Conference on
Conference_Location :
Taipei
Print_ISBN :
0-8186-6910-1
DOI :
10.1109/ICDE.1995.380370