• DocumentCode
    3249103
  • Title

    Mining top-k frequent closed patterns without minimum support

  • Author

    Han, Jiawei ; Wang, Jianyong ; Lu, Ying ; Tzvetkov, Petre

  • Author_Institution
    Illinois Univ., Urbana, IL, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    211
  • Lastpage
    218
  • Abstract
    In this paper, we propose a new mining task: mining top-k frequent closed patterns of length no less than min_ℓ, where k is the desired number of frequent closed patterns to be mined, and min_ℓ is the minimal length of each pattern. An efficient algorithm, called TFP, is developed for mining such patterns without minimum support. Two methods, closed-node-count and descendant-sum are proposed to effectively raise support threshold and prune FP-tree both during and after the construction of FP-tree. During the mining process, a novel top-down and bottom-up combined FP-tree mining strategy is developed to speed-up support-raising and closed frequent pattern discovering. In addition, a fast hash-based closed pattern verification scheme has been employed to check efficiently if a potential closed pattern is really closed. Our performance study shows that in most cases, TFP outperforms CLOSET and CHARM, two efficient frequent closed pattern mining algorithms, even when both are running with the best tuned min-support. Furthermore, the method can be extended to generate association rules and to incorporate user-specified constraints.
  • Keywords
    data mining; trees (mathematics); FP-tree pruning; TFP; closed frequent pattern discovery; closed-node-count method; descendant-sum method; efficient algorithm; fast hash-based closed pattern verification scheme; minimum support; support threshold; support-raising; top-k frequent closed pattern mining; Association rules; Data mining; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7695-1754-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2002.1183905
  • Filename
    1183905