DocumentCode
2820386
Title
GPR-Tree: a global parallel index structure for multiattribute declustering on cluster of workstations
Author
Fu, Xiaodong ; Wang, Dingxing ; Zheng, Weimin ; Sheng, Meiming
Author_Institution
Dept. of Comput. Sci., Tsinghua Univ., Beijing, China
fYear
1997
fDate
19-21 Mar 1997
Firstpage
300
Lastpage
306
Abstract
R-tree is a very popular dynamic access structure cable of storing multidimensional and spatial data. Considering it´s merit of the efficient global balance and dynamic reorganization, we try to use R-tree to decluster the multiattribute data in database system or file system. As many previous multiattribute declustering mechanisms do not take into account the properties of the Cluster of Workstations (COW), we present the Global Parallel R-tree (GPR-Tree) under the architecture of COW. Firstly we inspect the issues in efficiency of R-tree and it´s variants, we try to enhance the R-Tree efficiency by using heuristics information in the reconstruction of R-Tree during the node splitting and the treatment of the orphan entries of the underfilled node. Then we parallelize the improved R-Tree among the components in the system. The basic thought is to alleviate the bottleneck effect of the I/O subsystem, making use of the high speed network communication and the memory. The GPR-Tree is shared among the processing units (PU) of the system. We use a mixed LRU algorithm to schedule pages in memory to maintain the nodes visited frequently in memory. A write-update-like protocol is used to keep the coherency among multiple copies maintained in the system. This mechanism is proved efficient to improve the salability and performance of the system
Keywords
file organisation; protocols; spatial data structures; tree data structures; GPR-Tree; cluster of workstations; database system; dynamic access structure; dynamic reorganization; file system; global parallel index structure; high speed network communication; mixed LRU algorithm; multiattribute data; multiattribute declustering; orphan entries; performance; processing units; salability; spatial data; write-update-like protocol; Availability; Clustering algorithms; Communication cables; Costs; Database systems; High-speed networks; Multidimensional systems; Protocols; Scheduling algorithm; Workstations;
fLanguage
English
Publisher
ieee
Conference_Titel
Advances in Parallel and Distributed Computing, 1997. Proceedings
Conference_Location
Shanghai
Print_ISBN
0-8186-7876-3
Type
conf
DOI
10.1109/APDC.1997.574047
Filename
574047
Link To Document