Title :
IMBT--A Binary Tree for Efficient Support Counting of Incremental Data Mining
Author :
Yang, Chia-Han ; Yang, Don-Lin
Author_Institution :
Dept. of Inf. Eng. & Comput. Sci., Feng Chia Univ., Taichung, Taiwan
Abstract :
In the real world application, databases are updated continually. Most data mining approaches face the efficiency problem of repeating the mining process when the database is updated. Therefore, developing efficient approaches of incremental data mining is a critical issue for the real world data mining application. If we could use the previous analysis to incrementally mine the frequent itemsets from the updated database, the cost would be minimized. In this research, we propose a novel mining method with a data structure called IMBT (incremental mining binary tree) which is used to record the itemsets in an efficient way. Furthermore, our approach needs not to predetermine the minimum support threshold and scans the database only once. The results of our research indicate that our method not only performs incremental data mining more efficiently, but also finds frequent itemsets faster than the Apriori and FP-growth algorithms.
Keywords :
data mining; tree data structures; Apriori algorithm; FP-growth algorithm; binary tree; cost minimisation; data structure; database updation; frequent itemset mining; incremental data mining; minimum support threshold; support count; Application software; Binary trees; Computer science; Costs; Data engineering; Data mining; Data structures; Itemsets; Transaction databases; Tree data structures; Incremental data mining; binary tree; frequent itemset; support count;
Conference_Titel :
Computational Science and Engineering, 2009. CSE '09. International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4244-5334-4
Electronic_ISBN :
978-0-7695-3823-5
DOI :
10.1109/CSE.2009.360