DocumentCode
2403843
Title
Efficient OLAP query processing in distributed data warehouses
Author
Akinde, Michael ; Bohlen, Michael ; Johnson, Theodore ; Lakshmanan, L.V. ; Srivastava, Divesh
Author_Institution
Aalborg Univ., Denmark
fYear
2002
fDate
2002
Firstpage
262
Abstract
The success of Internet applications has led to an explosive growth in the demand for bandwidth from ISPs. Managing an IP network includes complex data analysis that can often be expressed as OLAP queries. Current day OLAP tools assume the availability of the detailed data in a centralized warehouse. However, the inherently distributed nature of the data collection (e.g., flow-level traffic statistics are gathered at network routers) and the huge amount of data extracted at each collection point (of the order of several gigabytes per day for large IP networks) makes such an approach highly impractical. The natural solution to this problem is to maintain a distributed data warehouse, consisting of multiple local data warehouses (sites) adjacent to the collection points, together with a coordinator. In order for such a solution to make sense, we need a technology for distributed processing of complex OLAP queries. We have developed the Skalla system for this task. We conducted an experimental study of the Skalla evaluation scheme using TPC(R) data
Keywords
Internet; data mining; data warehouses; distributed databases; query processing; relational databases; IP network; OLAP; Skalla system; data analysis; distributed data warehouse; experimental study; optimization; query processing; relational algebra; relational database; Bandwidth; Data analysis; Data mining; Data warehouses; Explosives; IP networks; Internet; Query processing; Statistical distributions; Telecommunication traffic;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2002. Proceedings. 18th International Conference on
Conference_Location
San Jose, CA
ISSN
1063-6382
Print_ISBN
0-7695-1531-2
Type
conf
DOI
10.1109/ICDE.2002.994716
Filename
994716
Link To Document