• Title of article

    Fast mining erasable itemsets using NC_sets

  • Author/Authors

    Deng، نويسنده , , Zhihong and Xu، نويسنده , , Xiao-Ran، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2012
  • Pages
    11
  • From page
    4453
  • To page
    4463
  • Abstract
    Mining erasable itemsets first introduced in 2009 is one of new emerging data mining tasks. In this paper, we present a new data representation called NC_set, which keeps track of the complete information used for mining erasable itemsets. Based on NC_set, we propose a new algorithm called MERIT for mining erasable itemsets efficiently. The efficiency of MERIT is achieved with three techniques as follows. First, the NC_set is a compact structure, which prunes irrelevant data automatically. Second, the computation of the gain of an itemset is transformed into the combination of NC_sets, which can be completed in linear time complexity by an ingenious strategy. Third, MERIT can directly find erasable itemsets without generating candidate itemsets in some cases. For evaluating MERIT, we have conducted extensive experiments on a lot of synthetic product databases. Our performance study shows that the MERIT is efficient and is on average about two orders of magnitude faster than the META, the first algorithm for mining erasable itemsets.
  • Keywords
    Algorithms , data structure , DATA MINING , NC_sets , Erasable itemsets
  • Journal title
    Expert Systems with Applications
  • Serial Year
    2012
  • Journal title
    Expert Systems with Applications
  • Record number

    2351473