• DocumentCode
    2826393
  • Title

    Non-Almost-Derivable Frequent Itemsets Mining

  • Author

    Xiaoming, Yang ; Zhibin, Wang ; Bing, Liu ; Shouzhi, Zhang ; Wei, Wang ; Bole, Shi

  • Author_Institution
    Dept. of Comput. & Inf. Technol., Fudan Univ., Shanghai
  • fYear
    2005
  • fDate
    21-23 Sept. 2005
  • Firstpage
    157
  • Lastpage
    161
  • Abstract
    The number of frequent itemsets is often too large to handle, so it is very necessary to work out a condensed representation of the collection of all frequent itemsets. In this paper, we propose a new condensed representation called frequent non-almost-derivable itemsets. This representation is a subset of the original collection of frequent itemsets. For any removed itemset X (which is called an frequent almost-derivable itemset), we can derive a lower and an upper bound of its support from this representation, and the lower bound and the upper bound is close enough (can be controlled by a user-defined parameter). We also propose an apriori-like algorithm, which can extract all frequent non-derivable itemsets. Extensive empirical results on real datasets show the compactness and good approximation of this representation
  • Keywords
    data mining; apriori-like algorithm; condensed representation; nonalmost-derivable frequent itemset mining; Association rules; Data mining; Information technology; Itemsets; Transaction databases; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology, 2005. CIT 2005. The Fifth International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    0-7695-2432-X
  • Type

    conf

  • DOI
    10.1109/CIT.2005.144
  • Filename
    1562644