DocumentCode :
2373500
Title :
On the complexity of two frequent set generation algorithms
Author :
Sprague, A.P.
fYear :
2004
fDate :
16-18 Dec. 2004
Firstpage :
344
Lastpage :
350
Abstract :
Frequent set generation is normally the first step toward association rule generation, and is usually considered the bottleneck step. Accordingly, many algorithms to generate frequent sets have been proposed. Evaluation of these algorithms is normally done empirically: their running times have been compared on a few or a substantial number of data files. Evaluation of the computational complexity of these algorithms has not much been pursued except to note that every algorithm requires exponential time, because the output size can be exponential in terms qf´ the input size. We develop an alternate method qf´ evaluating algorithms for frequent set generation: we propose evaluating the algorithms according to their computational complexity on several abstractly defined infinite families qf´ data files. We perform this evaluation for Apriori and F P-growth on two infinite families qf´data files, and show that on both families, a variant of F P-growth is unboundedly faster than Apriori as the data file size increases.
Keywords :
Algorithm design and analysis; Association rules; Benchmark testing; Computational complexity; Data mining; Extrapolation; Itemsets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications, 2004. Proceedings. 2004 International Conference on
Conference_Location :
Louisville, Kentucky, USA
Print_ISBN :
0-7803-8823-2
Type :
conf
DOI :
10.1109/ICMLA.2004.1383533
Filename :
1383533
Link To Document :
بازگشت