Title :
An Improved Method for Mining Generalized Frequent Itemsets Based on the Correlation between Items
Author :
Mao, Yu Xing ; Bai Le Shi
Author_Institution :
Dept. of Comput. & Inf. Technol., Fudan Univ., Shanghai
Abstract :
Mining generalized association rules is closely related to the taxonomy(is-a hierarchy) data which exists widely in retail, geography, biology and financial domains. If we use traditional method to mine the generalized association rules, it becomes inefficient because the itemsets will be huge along with the items and levels of taxonomy increasing, and it also wastes lots of time to calculate the support of redundant or unnecessary itemsets. In this paper, we proposes a new efficient method called CBP to partition the transaction database into several smaller ones level by level using correlation of itemsets, which make the mining more efficient by reducing the scanning size of transaction database. By experiments on the real-life transaction database, the results show that our CBP_based algorithms outperform the well-known algorithms.
Keywords :
data mining; transaction processing; very large databases; association rule mining; correlation-between-items method; generalized frequent itemset mining; transaction database; Application software; Association rules; Biology computing; Computer science; Credit cards; Data mining; Itemsets; Partitioning algorithms; Taxonomy; Transaction databases;
Conference_Titel :
Computer Science and its Applications, 2008. CSA '08. International Symposium on
Conference_Location :
Hobart, ACT
Print_ISBN :
978-0-7695-3428-2
DOI :
10.1109/CSA.2008.25