Title :
Hash-based symmetric data structure and join algorithm for OLAP applications
Author :
Toyama, Motomichi ; Ohara, Akira
Author_Institution :
Dept. of Inf. & Comput. Eng., Keio Univ., Japan
Abstract :
The star schema is often used in dimensional approaches applied to OLAP applications. The fact table in the star schema typically contains a huge amount of data. When some of the dimension tables are also very large, it may take too much time and storage to join the fact table with these dimension tables. The performance of the join algorithm becomes critical under such a condition. The fluent join is a join algorithm that operates on relations organized as multidimensional linear hash files. Like a merge join on relations which are already sorted on the joining key, its execution reads each page in the operand relations no more than once and does not create intermediate result files. Unlike sorting, the multi-dimensional linear hash can cluster records in several keys symmetrically. In this paper, the concept of the fluent join is applied to an OLAP system to cluster records in each table on the joining keys. As a result, the algorithm yields symmetric performances on joins with different dimension tables
Keywords :
data mining; data structures; database theory; relational databases; software performance evaluation; OLAP applications; dimension tables; fact table; fluent join algorithm; hash-based symmetric data structure; joining key; merge join; multidimensional linear hash files; operand relations; performance; record clustering; star schema; Application software; Clustering algorithms; Computer science; Data engineering; Data structures; Layout; Multidimensional systems; Relational databases; Sorting; Warehousing;
Conference_Titel :
Database Engineering and Applications, 1999. IDEAS '99. International Symposium Proceedings
Conference_Location :
Montreal, Que.
Print_ISBN :
0-7695-0265-2
DOI :
10.1109/IDEAS.1999.787272