DocumentCode
2459244
Title
Rule generation with the pattern repository
Author
Relue, Richard ; Wu, Xindong
Author_Institution
Dept. of Math. & Comput. Sci., Colorado Sch. of Mines, Golden, CO, USA
fYear
2002
fDate
2002
Firstpage
186
Lastpage
191
Abstract
Efficient algorithms for mining frequent patterns are crucial to many tasks in data mining. Since the Apriori algorithm was proposed in 1994, there have been several methods developed to improve its performance. However, most still adopt its candidate set generation-and-test approach. In addition, many methods do not generate all frequent patterns, making them inadequate to derive all association rules. The calculation of association rules from raw itemsets using Apriori is an intractable problem. By using a new structure called a pattern repository, the same rules can be derived in linear-time proportional to the number of unique items found. If the selected rules are all we need, the calculation can give results in real-time. In addition, the calculation can easily be divided into subsets for distributed processing and large datasets can be stored on disk that adds to the I/O overhead, but still offers a linear time calculation of rules.
Keywords
computational complexity; data mining; pattern recognition; tree searching; very large databases; Apriori algorithm; association rules; candidate set generation-and-test approach; data mining; distributed processing; frequent pattern mining; large datasets; linear time rule calculation; pattern repository; real-time; rule generation; search; time complexity; Data mining; Itemsets; Optimized production technology; Read only memory; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Artificial Intelligence Systems, 2002. (ICAIS 2002). 2002 IEEE International Conference on
Print_ISBN
0-7695-1733-1
Type
conf
DOI
10.1109/ICAIS.2002.1048085
Filename
1048085
Link To Document