• DocumentCode
    1220856
  • Title

    Divide-and-approximate: a novel constraint push strategy for iceberg cube mining

  • Author

    Wang, Ke ; Jiang, Yuelong ; Yu, Jeffrey Xu ; Dong, Guozhu ; Han, Jiawei

  • Author_Institution
    Dept. of Comput. Sci., Simon Fraser Univ., Burnaby, BC, Canada
  • Volume
    17
  • Issue
    3
  • fYear
    2005
  • fDate
    3/1/2005 12:00:00 AM
  • Firstpage
    354
  • Lastpage
    368
  • Abstract
    The iceberg cube mining computes all cells v, corresponding to GROUP BY partitions, that satisfy a given constraint on aggregated behaviors of the tuples in a GROUP BY partition. The number of cells often is so large that the result cannot be realistically searched without pushing the constraint into the search. Previous works have pushed antimonotone and monotone constraints. However, many useful constraints are neither antimonotone nor monotone. We consider a general class of aggregate constraints of the form f(v)θσ, where f is an arithmetic function of SQL-like aggregates and θ is one of <, ≤, ≥ >. We propose a novel pushing technique, called divide-and-approximate, to push such constraints. The idea is to recursively divide the search space and approximate the given constraint using antimonotone or monotone constraints in subspaces. This technique applies to a class called separable constraints, which properly contains all constraints built by an arithmetic function f of all SQL aggregates.
  • Keywords
    SQL; data integrity; data mining; divide and conquer methods; query processing; relational databases; very large databases; GROUP BY partitions; SQL aggregates; aggregate constraints; arithmetic function; constrained data mining; divide-and-approximate technique; iceberg cube mining; iceberg query; separable constraints; Aggregates; Arithmetic; Data mining; Humans; Prototypes; Subspace constraints;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2005.45
  • Filename
    1388246