• DocumentCode
    2837513
  • Title

    An Efficient Matrix Algorithm for Mining Frequent Itemsets

  • Author

    Xu, Zhangyan ; Gu, Dongyuan ; Wei, Song

  • Author_Institution
    Coll. of Comput. Sci. & Inf. Eng., Guangxi Normal Univ., Gulin, China
  • fYear
    2009
  • fDate
    11-13 Dec. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Efficient algorithms for mining frequent itemsets are crucial for mining association rules as well as for many other data mining tasks. In this paper, we integrate the merits of the matrix algorithm and Index-BitTableFI algorithm, and design an efficient algorithm for mining the frequent itemsets. In the new algorithm, it may be generated directly some frequent itemsets which do not generate in the Index-BitTableFI. At the same time, we do not use recursive method which is time-consuming to compute the other frequent itemsets in Index-BitTableFI algorithm, and use breadth-first search strategy to generate all frequent itemsets. On the other hand, we use the method of the matrix algorithm to compute the supports of the frequent itemsets which do not generate with subsume index technology. Since there are many frequent itemsets which can be generated directly in the new algorithm, the efficiency of the new algorithm is improved. Then an example is used to illustrate the new algorithm. The results of the experiment show that the new algorithm in performance is more remarkable for mining the long and small supports frequent itemsets for sparse datasets and mining frequent itemsets in dense datasets.
  • Keywords
    data mining; matrix algebra; tree searching; Index-BitTableFI algorithm; association rule; breadth-first search strategy; frequent itemsets mining; matrix algorithm; Algorithm design and analysis; Association rules; Clustering algorithms; Data engineering; Data mining; Databases; Educational institutions; Electronic mail; Itemsets; Partitioning algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-4507-3
  • Electronic_ISBN
    978-1-4244-4507-3
  • Type

    conf

  • DOI
    10.1109/CISE.2009.5364537
  • Filename
    5364537