• DocumentCode
    745210
  • Title

    Efficient aggregation algorithms for compressed data warehouses

  • Author

    Li, Janzhong ; Srivastava, Jaideep

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Harbin Inst. of Technol., China
  • Volume
    14
  • Issue
    3
  • fYear
    2002
  • Firstpage
    515
  • Lastpage
    529
  • Abstract
    Aggregation and cube are important operations for online analytical processing (OLAP). Many efficient algorithms to compute aggregation and cube for relational OLAP have been developed. Some work has been done on efficiently computing cube for multidimensional data warehouses that store data sets in multidimensional arrays rather than in tables. However, to our knowledge, there is nothing to date in the literature describing aggregation algorithms on compressed data warehouses for multidimensional OLAP. This paper presents a set of aggregation algorithms on compressed data warehouses for multidimensional OLAP. These algorithms operate directly on compressed data sets, which are compressed by the mapping-complete compression methods, without the need to first decompress them. The algorithms have different performance behaviors as a function of the data set parameters, sizes of outputs and main memory availability. The algorithms are described and the I/O and CPU cost functions are presented in this paper. A decision procedure to select the most efficient algorithm for a given aggregation request is also proposed. The analysis and experimental results show that the algorithms have better performance on sparse data than the previous aggregation algorithms
  • Keywords
    data compression; data mining; data warehouses; software performance evaluation; CPU cost functions; OLAP; aggregation algorithms; compressed data warehouses; cube; data mining; experimental results; main memory availability; multidimensional OLAP; multidimensional arrays; multidimensional data warehouses; online analytical processing; performance; relational database; Data warehouses;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2002.1000340
  • Filename
    1000340