• DocumentCode
    1800924
  • Title

    IMBT--A Binary Tree for Efficient Support Counting of Incremental Data Mining

  • Author

    Yang, Chia-Han ; Yang, Don-Lin

  • Author_Institution
    Dept. of Inf. Eng. & Comput. Sci., Feng Chia Univ., Taichung, Taiwan
  • Volume
    1
  • fYear
    2009
  • fDate
    29-31 Aug. 2009
  • Firstpage
    324
  • Lastpage
    329
  • Abstract
    In the real world application, databases are updated continually. Most data mining approaches face the efficiency problem of repeating the mining process when the database is updated. Therefore, developing efficient approaches of incremental data mining is a critical issue for the real world data mining application. If we could use the previous analysis to incrementally mine the frequent itemsets from the updated database, the cost would be minimized. In this research, we propose a novel mining method with a data structure called IMBT (incremental mining binary tree) which is used to record the itemsets in an efficient way. Furthermore, our approach needs not to predetermine the minimum support threshold and scans the database only once. The results of our research indicate that our method not only performs incremental data mining more efficiently, but also finds frequent itemsets faster than the Apriori and FP-growth algorithms.
  • Keywords
    data mining; tree data structures; Apriori algorithm; FP-growth algorithm; binary tree; cost minimisation; data structure; database updation; frequent itemset mining; incremental data mining; minimum support threshold; support count; Application software; Binary trees; Computer science; Costs; Data engineering; Data mining; Data structures; Itemsets; Transaction databases; Tree data structures; Incremental data mining; binary tree; frequent itemset; support count;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Science and Engineering, 2009. CSE '09. International Conference on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    978-1-4244-5334-4
  • Electronic_ISBN
    978-0-7695-3823-5
  • Type

    conf

  • DOI
    10.1109/CSE.2009.360
  • Filename
    5283173