• DocumentCode
    2580621
  • Title

    FCompress: a new technique for queriable compression of facts and datacubes

  • Author

    Furtado, Pedro ; Madeira, H.

  • Author_Institution
    Coimbra Univ., Portugal
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    197
  • Lastpage
    206
  • Abstract
    Decision support applications must analyze information from data warehouses efficiently. For this reason, huge data warehouses must have mechanisms to cope with massive amounts of data. Reducing and compressing fact tables, summary tables and data cubes is important for faster operation and smaller storage overhead. Traditional compression techniques are not useful in this context except for archiving, because they render the data unqueriable. Although data reduction techniques are useful for fast approximate answers to complex queries, their accuracy is not enough to replace the base data. We present FCompress, a new fact compression technique that effectively replaces the base data, compressing it while maintaining queriability. The approach is based on the premise that a very small and adjustable error is acceptable in many fact attributes. The technique is applicable to fact and summary tables and data cubes alike. It has been evaluated, showing that very small errors can be achieved for point reconstruction (typically below 2%) while the original fact table is reduced to about 35% to 60% of its size and the data cube is reduced to about 15% to 30% of the size. The error is even smaller for typical OLAP queries, usually less than 1%, depending on the degree of aggregation
  • Keywords
    data compression; data mining; data structures; data warehouses; decision support systems; query processing; FCompress; OLAP; aggregation; data compression; data cubes; data reduction; data warehouses; decision support applications; error; fact tables; queries; storage overhead; summary tables; Data warehouses; Decision support systems; Degradation; Lattices; Levee; Multidimensional systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Engineering and Applications Symposium, 2000 International
  • Conference_Location
    Yokohama
  • Print_ISBN
    0-7695-0789-1
  • Type

    conf

  • DOI
    10.1109/IDEAS.2000.880578
  • Filename
    880578