DocumentCode :
2849054
Title :
Scrutinizing Frequent Pattern Discovery Performance
Author :
Zaïane, Osmar R. ; El-Hajj, Mohammad ; Li, Yi ; Luk, Stella
Author_Institution :
Dept. of Comput. Sci., Alberta Univ., Edmonton, Alta., Canada
fYear :
2005
fDate :
05-08 April 2005
Firstpage :
1109
Lastpage :
1110
Abstract :
Benchmarking technical solutions is as important as the solutions themselves. Yet many fields still lack any type of rigorous evaluation. Performance benchmarking has always been an important issue in databases and has played a significant role in the development, deployment and adoption of technologies. To help assessing the myriad algorithms for frequent itemset mining, we built an open framework and testbed to analytically study the performance of different algorithms and their implementations, and contrast their achievements given different data characteristics, different conditions, and different types of patterns to discover and their constraints. This facilitates reporting consistent and reproducible performance results using known conditions.
Keywords :
data mining; pattern recognition; very large databases; frequent itemset mining; frequent pattern discovery performance; myriad algorithm; Algorithm design and analysis; Association rules; Benchmark testing; Clustering algorithms; Data analysis; Data mining; Databases; Itemsets; Pattern analysis; Performance analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
ISSN :
1084-4627
Print_ISBN :
0-7695-2285-8
Type :
conf
DOI :
10.1109/ICDE.2005.127
Filename :
1410224
Link To Document :
بازگشت