Title :
Probabilistic model and optimal reorganization of B+-tree with physical clustering
Author :
Park, June S. ; Sridhar, V.
Author_Institution :
Dept. of Manage. Sci., Iowa Univ., Iowa City, IA, USA
Abstract :
We consider a variant of B+-tree called the B+ -tree with physical clustering (BC), which is exemplified by VSAM key-sequenced datasets. In this file, logically contiguous data buckets are physically clustered in a storage unit called a region in order to enhance the efficiency of sequential processing. As a consequence, records in a bucket are redistributed over time according to two different split algorithms: bucket split and region split. A stochastic model of the disorganization of BC files is constructed using order statistics. Database administrators of online transaction systems often need to reorganize BC files before a region split occurs. An efficient algorithm for determining the optimal reorganization time of BC files is developed. Analytic results are validated by simulation. In passing, we discuss guidelines for load-time configuration of BC files
Keywords :
database theory; optimisation; probability; relational databases; statistical analysis; transaction processing; tree data structures; B+ tree; BC file disorganization; VSAM key-sequenced datasets; bucket split; load-time configuration; logically contiguous data buckets; online transaction systems; optimal reorganization; optimal reorganization time; order statistics; physical clustering; probabilistic model; region split; relational database; sequential processing; simulation; split algorithms; stochastic model; storage unit; Analytical models; Clustering algorithms; Computational modeling; Delay; Distributed computing; Guidelines; Statistics; Stochastic processes; Transaction databases;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on