DocumentCode :
3133947
Title :
Upper bound on the length of generalized disjunction-free patterns
Author :
Kryszkiewicz, Marzena
Author_Institution :
Inst. of Comput. Sci., Warsaw Univ. of Technol., Poland
fYear :
2004
fDate :
21-23 June 2004
Firstpage :
31
Lastpage :
40
Abstract :
A number of lossless representations of frequent patterns were proposed in recent years. The representation that consists of all frequent closed itemsets and the representations based on generalized disjunction-free patterns or on non-derivable itemsets are proven the most concise ones. Experiments show further that the latter ones are by a few orders of magnitude more concise (and determinable) than the former one. As follows from experiments, the representations based on generalized disjunction-free patterns are also more concise than the available in the literature representations of frequent patterns, which determine supports of patterns in an approximate way. In this paper, we provide an upper bound on the length of generalized disjunction-free patterns. The bound determines the maximum number of scans of the database carried out by a priori-like algorithms discovering the representations based on generalized disjunction-free patterns.
Keywords :
data mining; pattern recognition; a priori-like algorithm; data representation; database scanning; frequent pattern representation; generalized disjunction-free patterns; knowledge discovery; nonderivable itemsets; Association rules; Chromium; Clustering algorithms; Computer science; Conference management; Data mining; Itemsets; Spatial databases; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Scientific and Statistical Database Management, 2004. Proceedings. 16th International Conference on
ISSN :
1099-3371
Print_ISBN :
0-7695-2146-0
Type :
conf
DOI :
10.1109/SSDM.2004.1311191
Filename :
1311191
Link To Document :
بازگشت