• DocumentCode
    802566
  • Title

    New Algorithm for Computing Cube on Very Large Compressed Data Sets

  • Author

    Wu, Weili ; Gao, Hong ; Li, Jianzhong

  • Author_Institution
    Dept. of Comput. Sci. Eng., Texas Univ., Richardson, TX
  • Volume
    18
  • Issue
    12
  • fYear
    2006
  • Firstpage
    1667
  • Lastpage
    1680
  • Abstract
    Data compression is an effective technique to improve the performance of data warehouses. Since cube operation represents the core of online analytical processing in data warehouses, it is a major challenge to develop efficient algorithms for computing cube on compressed data warehouses. To our knowledge, very few cube computation techniques have been proposed for compressed data warehouses to date in the literature. This paper presents a novel algorithm to compute cubes on compressed data warehouses. The algorithm operates directly on compressed data sets without the need of first decompressing them. The algorithm is applicable to a large class of mapping complete data compression methods. The complexity of the algorithm is analyzed in detail. The analytical and experimental results show that the algorithm is more efficient than all other existing cube algorithms. In addition, a heuristic algorithm to generate an optimal plan for computing cube is also proposed
  • Keywords
    computational complexity; data compression; data mining; data warehouses; algorithm complexity; compressed data warehouses; cube computation techniques; data compression; heuristic algorithm; online analytical processing; very large compressed data sets; Algorithm design and analysis; Computer applications; Costs; Data analysis; Data compression; Data warehouses; Databases; Decision making; Heuristic algorithms; Multidimensional systems; Data warehouses; OLAP.; cube operation; data compression;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2006.195
  • Filename
    1717423