• DocumentCode
    3133947
  • Title

    Upper bound on the length of generalized disjunction-free patterns

  • Author

    Kryszkiewicz, Marzena

  • Author_Institution
    Inst. of Comput. Sci., Warsaw Univ. of Technol., Poland
  • fYear
    2004
  • fDate
    21-23 June 2004
  • Firstpage
    31
  • Lastpage
    40
  • Abstract
    A number of lossless representations of frequent patterns were proposed in recent years. The representation that consists of all frequent closed itemsets and the representations based on generalized disjunction-free patterns or on non-derivable itemsets are proven the most concise ones. Experiments show further that the latter ones are by a few orders of magnitude more concise (and determinable) than the former one. As follows from experiments, the representations based on generalized disjunction-free patterns are also more concise than the available in the literature representations of frequent patterns, which determine supports of patterns in an approximate way. In this paper, we provide an upper bound on the length of generalized disjunction-free patterns. The bound determines the maximum number of scans of the database carried out by a priori-like algorithms discovering the representations based on generalized disjunction-free patterns.
  • Keywords
    data mining; pattern recognition; a priori-like algorithm; data representation; database scanning; frequent pattern representation; generalized disjunction-free patterns; knowledge discovery; nonderivable itemsets; Association rules; Chromium; Clustering algorithms; Computer science; Conference management; Data mining; Itemsets; Spatial databases; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Management, 2004. Proceedings. 16th International Conference on
  • ISSN
    1099-3371
  • Print_ISBN
    0-7695-2146-0
  • Type

    conf

  • DOI
    10.1109/SSDM.2004.1311191
  • Filename
    1311191