Title :
Improving OLAP performance by multidimensional hierarchical clustering
Author :
Markl, Volker ; Ramsak, Frank ; Bayer, Rudolf
Author_Institution :
Bayerisches Forschungszentrum fur Wissensbasierte Syst., Munchen, Germany
Abstract :
Data warehousing applications cope with enormous data sets in the range of Gigabytes and Terabytes. Queries usually either select a very small set of this data or perform aggregations on a fairly large data set. Materialized views storing pre-computed aggregates are used to efficiently process queries with aggregations. This approach increases resource requirements in disk space and slows down updates because of the view maintenance problem. Multidimensional hierarchical clustering (MHC) of OLAP data overcomes these problems while offering more flexibility for aggregation paths. Clustering is introduced as a way to speed up aggregation queries without additional storage cost for materialization. Performance and storage cost of our access method are investigated and compared to current query processing scenarios. In addition performance measurements on real world data for a typical star schema are presented
Keywords :
data mining; data warehouses; distributed databases; pattern clustering; query processing; OLAP data; OLAP performance; access method; aggregation paths; aggregation queries; data warehousing applications; disk space; large data set; materialized views; multidimensional hierarchical clustering; performance measurements; pre-computed aggregates; query processing scenarios; real world data; resource requirements; star schema; storage cost; view maintenance problem; Aggregates; Costs; Data models; Data processing; Database languages; Delay; Material storage; Multidimensional systems; Query processing; Warehousing;
Conference_Titel :
Database Engineering and Applications, 1999. IDEAS '99. International Symposium Proceedings
Conference_Location :
Montreal, Que.
Print_ISBN :
0-7695-0265-2
DOI :
10.1109/IDEAS.1999.787265