مرکز منطقه ای اطلاع رساني علوم و فناوري - On the complexity of two frequent set generation algorithms

DocumentCode :

2373500

Title :

On the complexity of two frequent set generation algorithms

Author :

Sprague, A.P.

fYear :

2004

fDate :

16-18 Dec. 2004

Firstpage :

344

Lastpage :

350

Abstract :

Frequent set generation is normally the first step toward association rule generation, and is usually considered the bottleneck step. Accordingly, many algorithms to generate frequent sets have been proposed. Evaluation of these algorithms is normally done empirically: their running times have been compared on a few or a substantial number of data files. Evaluation of the computational complexity of these algorithms has not much been pursued except to note that every algorithm requires exponential time, because the output size can be exponential in terms qf´ the input size. We develop an alternate method qf´ evaluating algorithms for frequent set generation: we propose evaluating the algorithms according to their computational complexity on several abstractly defined infinite families qf´ data files. We perform this evaluation for Apriori and F P-growth on two infinite families qf´data files, and show that on both families, a variant of F P-growth is unboundedly faster than Apriori as the data file size increases.

Keywords :

Algorithm design and analysis; Association rules; Benchmark testing; Computational complexity; Data mining; Extrapolation; Itemsets;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning and Applications, 2004. Proceedings. 2004 International Conference on

Conference_Location :

Louisville, Kentucky, USA

Print_ISBN :

0-7803-8823-2

Type :

conf

DOI :

10.1109/ICMLA.2004.1383533

Filename :

1383533

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2373500