DocumentCode
3133947
Title
Upper bound on the length of generalized disjunction-free patterns
Author
Kryszkiewicz, Marzena
Author_Institution
Inst. of Comput. Sci., Warsaw Univ. of Technol., Poland
fYear
2004
fDate
21-23 June 2004
Firstpage
31
Lastpage
40
Abstract
A number of lossless representations of frequent patterns were proposed in recent years. The representation that consists of all frequent closed itemsets and the representations based on generalized disjunction-free patterns or on non-derivable itemsets are proven the most concise ones. Experiments show further that the latter ones are by a few orders of magnitude more concise (and determinable) than the former one. As follows from experiments, the representations based on generalized disjunction-free patterns are also more concise than the available in the literature representations of frequent patterns, which determine supports of patterns in an approximate way. In this paper, we provide an upper bound on the length of generalized disjunction-free patterns. The bound determines the maximum number of scans of the database carried out by a priori-like algorithms discovering the representations based on generalized disjunction-free patterns.
Keywords
data mining; pattern recognition; a priori-like algorithm; data representation; database scanning; frequent pattern representation; generalized disjunction-free patterns; knowledge discovery; nonderivable itemsets; Association rules; Chromium; Clustering algorithms; Computer science; Conference management; Data mining; Itemsets; Spatial databases; Upper bound;
fLanguage
English
Publisher
ieee
Conference_Titel
Scientific and Statistical Database Management, 2004. Proceedings. 16th International Conference on
ISSN
1099-3371
Print_ISBN
0-7695-2146-0
Type
conf
DOI
10.1109/SSDM.2004.1311191
Filename
1311191
Link To Document