Title :
Upper bound on the length of generalized disjunction-free patterns
Author :
Kryszkiewicz, Marzena
Author_Institution :
Inst. of Comput. Sci., Warsaw Univ. of Technol., Poland
Abstract :
A number of lossless representations of frequent patterns were proposed in recent years. The representation that consists of all frequent closed itemsets and the representations based on generalized disjunction-free patterns or on non-derivable itemsets are proven the most concise ones. Experiments show further that the latter ones are by a few orders of magnitude more concise (and determinable) than the former one. As follows from experiments, the representations based on generalized disjunction-free patterns are also more concise than the available in the literature representations of frequent patterns, which determine supports of patterns in an approximate way. In this paper, we provide an upper bound on the length of generalized disjunction-free patterns. The bound determines the maximum number of scans of the database carried out by a priori-like algorithms discovering the representations based on generalized disjunction-free patterns.
Keywords :
data mining; pattern recognition; a priori-like algorithm; data representation; database scanning; frequent pattern representation; generalized disjunction-free patterns; knowledge discovery; nonderivable itemsets; Association rules; Chromium; Clustering algorithms; Computer science; Conference management; Data mining; Itemsets; Spatial databases; Upper bound;
Conference_Titel :
Scientific and Statistical Database Management, 2004. Proceedings. 16th International Conference on
Print_ISBN :
0-7695-2146-0
DOI :
10.1109/SSDM.2004.1311191