Title :
On solving the view selection problem in distributed data warehouse architectures
Author :
Bauer, Andreas ; Lehner, Wolfgang
Author_Institution :
T-Syst. Nova GmbH, Nuremberg, Germany
Abstract :
The use of materialized views in a data warehouse installation is a common tool to speed up mostly aggregation queries. The problems coming along with materialized aggregate views have triggered a huge variety of proposals, such as picking the optimal set of aggregation combinations, transparently rewriting user queries to take advantage of the summary data, or synchronizing pre-computed summary data as soon as the base data changes. The paper focuses on the problem of view selection in the context of distributed data warehouse architectures. While much research was done with regard to the view selection problem in the central case, we are not aware to any other work discussing the problem of view selection in distributed data warehouse systems. The paper proposes an extension of the concept of an aggregation lattice to capture the distributed semantics. Moreover, we extend a greedy-based selection algorithm based on an adequate cost model for the distributed case. Within a performance study, we finally compare our findings with the approach of applying a selection algorithm locally to each node in a distributed warehouse environment.
Keywords :
data warehouses; distributed databases; query processing; data analysis; data information; data installation; data warehouse; distributed data warehouse architectures; distributed semantic; greedy algorithm; selection algorithm; summary data; Aggregates; Conference management; Costs; Data warehouses; Databases; Lattices; Material storage; Space technology; Technology management; Time factors;
Conference_Titel :
Scientific and Statistical Database Management, 2003. 15th International Conference on
Print_ISBN :
0-7695-1964-4
DOI :
10.1109/SSDM.2003.1214953