• DocumentCode
    2335323
  • Title

    LPMiner: an algorithm for finding frequent itemsets using length-decreasing support constraint

  • Author

    Seno, Masakazu ; Karypis, George

  • Author_Institution
    Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    505
  • Lastpage
    512
  • Abstract
    Over the years, a variety of algorithms for finding frequent item sets in very large transaction databases has been developed. The key feature in most of these algorithms is that they use a constant support constraint to control the inherently exponential complexity of the problem. In general, item sets that contain only a few items tend to be interesting if they have a high support, whereas long item sets can still be interesting even if their support is relatively small. Ideally, we desire to have an algorithm that finds all the frequent item sets whose support decreases as a function of their length. In this paper, we present an algorithm called LPMiner (Long Pattern Miner) that finds all item sets that satisfy a length-decreasing support constraint. Our experimental evaluation shows that LPMiner is up to two orders of magnitude faster than the FP-growth algorithm for finding item sets at a constant support constraint, and that its run-time increases gradually as the average length of the transactions (and the discovered item sets) increases
  • Keywords
    computational complexity; data mining; pattern recognition; transaction processing; tree data structures; very large databases; FP-growth algorithm; FP-trees; LPMiner algorithm; constant support constraint; exponential complexity; frequent item-set discovery; length-decreasing support constraint; long item sets; run time; smallest valid extension; transaction length; very large transaction databases; Association rules; Computer science; Contracts; Data engineering; Data mining; High performance computing; Itemsets; Runtime; Transaction databases; US Department of Energy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • Print_ISBN
    0-7695-1119-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2001.989558
  • Filename
    989558