• DocumentCode
    2459244
  • Title

    Rule generation with the pattern repository

  • Author

    Relue, Richard ; Wu, Xindong

  • Author_Institution
    Dept. of Math. & Comput. Sci., Colorado Sch. of Mines, Golden, CO, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    186
  • Lastpage
    191
  • Abstract
    Efficient algorithms for mining frequent patterns are crucial to many tasks in data mining. Since the Apriori algorithm was proposed in 1994, there have been several methods developed to improve its performance. However, most still adopt its candidate set generation-and-test approach. In addition, many methods do not generate all frequent patterns, making them inadequate to derive all association rules. The calculation of association rules from raw itemsets using Apriori is an intractable problem. By using a new structure called a pattern repository, the same rules can be derived in linear-time proportional to the number of unique items found. If the selected rules are all we need, the calculation can give results in real-time. In addition, the calculation can easily be divided into subsets for distributed processing and large datasets can be stored on disk that adds to the I/O overhead, but still offers a linear time calculation of rules.
  • Keywords
    computational complexity; data mining; pattern recognition; tree searching; very large databases; Apriori algorithm; association rules; candidate set generation-and-test approach; data mining; distributed processing; frequent pattern mining; large datasets; linear time rule calculation; pattern repository; real-time; rule generation; search; time complexity; Data mining; Itemsets; Optimized production technology; Read only memory; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence Systems, 2002. (ICAIS 2002). 2002 IEEE International Conference on
  • Print_ISBN
    0-7695-1733-1
  • Type

    conf

  • DOI
    10.1109/ICAIS.2002.1048085
  • Filename
    1048085