DocumentCode :
402886
Title :
An equivalence-class-based algorithm for the maximal number of candidate itemsets
Author :
Wang, Yun-Lan ; Li, Zeng-Zhi ; Qu, Ke-Wen
Author_Institution :
Dept. of Comput. Sci. & Technol., Xi´´an Jiaotong Univ., China
Volume :
1
fYear :
2003
fDate :
2-5 Nov. 2003
Firstpage :
272
Abstract :
Mining association rules is one of the most important problems in the field of data mining. To device an algorithm that can reduce the number of database scans and without taking the risk of getting a combinatorial explosion of the number of candidate itemsets, we must study what is the maximal number of candidate itemsets that can be generated. In this paper, the theory of itemset equivalence class is proposed. The property of itemset equivalence class is explored and some useful lemmas are presented. Based on the foregoing theory and a priori property, we derive some theorems about the maximal number of candidate itemsets. Based on these theorems, we device an algorithm EC for calculating the maximal number of candidate itemsets, Furthermore, the performance study shows that algorithm EC is more accurate than algorithm KK and the cost for computing the maximal number of negligible compared to the cost of the complete algorithm for association rules.
Keywords :
data mining; database theory; set theory; candidate itemsets; data mining; database scans; equivalence-class-based algorithm; itemset equivalence class; mining association rules; Association rules; Computer science; Costs; Data mining; Explosions; Itemsets; Partitioning algorithms; Switches; Transaction databases; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2003 International Conference on
Print_ISBN :
0-7803-8131-9
Type :
conf
DOI :
10.1109/ICMLC.2003.1264485
Filename :
1264485
Link To Document :
بازگشت