DocumentCode :
1055826
Title :
Performance analysis of a concurrent file reorganization algorithm for record clustering
Author :
Omiecinski, Edward ; Lee, Liehuey ; Scheuermann, Peter
Author_Institution :
Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
6
Issue :
2
fYear :
1994
fDate :
4/1/1994 12:00:00 AM
Firstpage :
248
Lastpage :
257
Abstract :
Presents a simulation-based performance analysis of a concurrent file reorganization algorithm. We examine the effect on throughput of (a) buffer size, (b) degree of reorganization, (c) write probability of transactions, (d) multiprogramming level, and (e) degree of clustered transactions. The problem of file reorganization that we consider involves altering the placement of records on pages of a secondary storage device. In addition, we want this reorganization to be done in place, i.e. using the file´s original storage space for the newly reorganized file. Our approach is appropriate for a non-in-place reorganization as well. The motivation for such a physical change, i.e. record clustering, is to improve the database system´s performance, i.e. minimizing the number of page accesses made in answering a set of queries. There are numerous record clustering algorithms, but they usually do not solve the entire problem, i.e., they do not specify how to efficiently reorganize the file to reflect the clustering assignment that they determine. In previous work, we have presented an algorithm that is a companion to general record clustering algorithms, i.e. it actually transforms the file. In this work we show through simulation that our algorithm, when run concurrently with user transactions, provides an acceptable level of overall database system performance
Keywords :
file organisation; multiprogramming; parallel algorithms; performance evaluation; buffer size; clustered transactions; clustering assignment; concurrent file reorganization algorithm; database system performance; multiprogramming level; page accesses; query answering; record clustering; record placement; secondary storage device; simulation-based performance analysis; throughput; transaction write probability; Analytical models; Clustering algorithms; Computer science; Costs; Database systems; Indexes; Performance analysis; Relational databases; Throughput; Writing;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.277769
Filename :
277769
Link To Document :
بازگشت