DocumentCode :
2826393
Title :
Non-Almost-Derivable Frequent Itemsets Mining
Author :
Xiaoming, Yang ; Zhibin, Wang ; Bing, Liu ; Shouzhi, Zhang ; Wei, Wang ; Bole, Shi
Author_Institution :
Dept. of Comput. & Inf. Technol., Fudan Univ., Shanghai
fYear :
2005
fDate :
21-23 Sept. 2005
Firstpage :
157
Lastpage :
161
Abstract :
The number of frequent itemsets is often too large to handle, so it is very necessary to work out a condensed representation of the collection of all frequent itemsets. In this paper, we propose a new condensed representation called frequent non-almost-derivable itemsets. This representation is a subset of the original collection of frequent itemsets. For any removed itemset X (which is called an frequent almost-derivable itemset), we can derive a lower and an upper bound of its support from this representation, and the lower bound and the upper bound is close enough (can be controlled by a user-defined parameter). We also propose an apriori-like algorithm, which can extract all frequent non-derivable itemsets. Extensive empirical results on real datasets show the compactness and good approximation of this representation
Keywords :
data mining; apriori-like algorithm; condensed representation; nonalmost-derivable frequent itemset mining; Association rules; Data mining; Information technology; Itemsets; Transaction databases; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology, 2005. CIT 2005. The Fifth International Conference on
Conference_Location :
Shanghai
Print_ISBN :
0-7695-2432-X
Type :
conf
DOI :
10.1109/CIT.2005.144
Filename :
1562644
Link To Document :
بازگشت