DocumentCode :
2709951
Title :
Performance evaluation of functional disk system with nonuniform data distribution
Author :
Kitsuregawa, Masaru ; Nakano, M. ; Takagi, Maki
Author_Institution :
Inst. of Ind. Sci., Tokyo Univ.
fYear :
1990
fDate :
2-4 Jul 1990
Firstpage :
80
Lastpage :
89
Abstract :
The performance of a functional disk system with relational database engine (FDS-RII) for a nonuniform data distribution is analyzed. FDS-RII is a relational storage system, designed to accelerate relational algebraic operations which uses a hash-based algorithm to process relational operations. In the has-based algorithm, a relation is first partitioned into several clusters by a split function. Each cluster is then staged onto the main memory and further, a hash function is applied to each cluster to perform a relational operation. Thus, the nonuniformity of split and hash functions is considered to result from a nonuniform data distribution on the hash-based algorithm. It is possible to attenuate the effect of the hash function nonuniformity by increasing the number of processors and processing the buckets in parallel. In order to address the nonuniformity of split function, the combined hash algorithm is introduced. This algorithm combines the grace hash algorithm with the nested loop algorithm in order to handle the overflown bucket efficiently. Using the combined hash algorithm, it is found that the execution time of the nonuniform data distribution is almost equal to that of the uniform data distribution
Keywords :
file organisation; magnetic disc storage; relational databases; FDS-RII; execution time; functional disk system; grace hash algorithm; has-based algorithm; hash function; hash functions; hash-based algorithm; nested loop algorithm; nonuniform data distribution; nonuniformity; overflown bucket; performance evaluation; relational algebraic operations; relational database engine; relational operation; relational storage system; split function; uniform data distribution; Acceleration; Algorithm design and analysis; Buffer storage; Clustering algorithms; Database machines; Engines; Parallel processing; Partitioning algorithms; Performance analysis; Relational databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Databases in Parallel and Distributed Systems, 1990, Proceedings. Second International Symposium on
Conference_Location :
Dublin
Print_ISBN :
0-8186-2052-8
Type :
conf
DOI :
10.1109/DPDS.1990.113700
Filename :
113700
Link To Document :
بازگشت