DocumentCode :
2748666
Title :
Design and implementation of a scalable parallel system for multidimensional analysis and OLAP
Author :
Goil, Sanjay ; Choudhary, Alok
Author_Institution :
Dept. of Electr. & Comput. Eng., Northwestern Univ., Evanston, IL, USA
fYear :
1999
fDate :
12-16 Apr 1999
Firstpage :
576
Lastpage :
581
Abstract :
Multidimensional Analysis and On-Line Analytical Processing (OLAP) uses summary information that requires aggregate operations along one or more dimensions of numerical data values. Query processing for these applications require different views of data for decision support. The Data Cube operator provides multi-dimensional aggregates, used to calculate and store summary information on a number of dimensions. The multi-dimensionality of the underlying problem can be represented both in relational and multi-dimensional databases, the latter being a better fit when query performance is the criteria for judgment. Relational databases are scalable in size and efforts are on to make their performance acceptable. On the other hand multi-dimensional databases perform well for such queries, although they are nor very scalable. Parallel computing is necessary to address the scalability and performance issues for these data sets. In this paper we present a parallel and scalable infrastructure for OLAP and multidimensional analysis. We use chunking to store data either as a dense block using multidimensional arrays (md-arrays) or a sparse set using a Bit encoded sparse structure (BESS). Chunks provide a multidimensional index structure for efficient dimension oriented data accesses much the same as md-arrays do. Operations within chunks and between chunks are a combination of relational and multi-dimensional operations depending on whether the chunk is sparse or dense. We present performance results for data sets with 3, 5 and 10 dimensions for our implementation on the IBM SP-2 which show good speedup and scalability
Keywords :
decision support systems; parallel programming; query processing; relational databases; very large databases; Bit encoded sparse structure; OLAP; On-Line Analytical Processing; chunking; decision support; decision support systems; large databases; multidimensional analysis; performance results; query performance; query processing; relational databases; scalable parallel system; Aggregates; Data mining; Data structures; Databases; Decision support systems; Information analysis; Multidimensional systems; Performance analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing, 1999. 13th International and 10th Symposium on Parallel and Distributed Processing, 1999. 1999 IPPS/SPDP. Proceedings
Conference_Location :
San Juan
Print_ISBN :
0-7695-0143-5
Type :
conf
DOI :
10.1109/IPPS.1999.760535
Filename :
760535
Link To Document :
بازگشت