DocumentCode :
2459244
Title :
Rule generation with the pattern repository
Author :
Relue, Richard ; Wu, Xindong
Author_Institution :
Dept. of Math. & Comput. Sci., Colorado Sch. of Mines, Golden, CO, USA
fYear :
2002
fDate :
2002
Firstpage :
186
Lastpage :
191
Abstract :
Efficient algorithms for mining frequent patterns are crucial to many tasks in data mining. Since the Apriori algorithm was proposed in 1994, there have been several methods developed to improve its performance. However, most still adopt its candidate set generation-and-test approach. In addition, many methods do not generate all frequent patterns, making them inadequate to derive all association rules. The calculation of association rules from raw itemsets using Apriori is an intractable problem. By using a new structure called a pattern repository, the same rules can be derived in linear-time proportional to the number of unique items found. If the selected rules are all we need, the calculation can give results in real-time. In addition, the calculation can easily be divided into subsets for distributed processing and large datasets can be stored on disk that adds to the I/O overhead, but still offers a linear time calculation of rules.
Keywords :
computational complexity; data mining; pattern recognition; tree searching; very large databases; Apriori algorithm; association rules; candidate set generation-and-test approach; data mining; distributed processing; frequent pattern mining; large datasets; linear time rule calculation; pattern repository; real-time; rule generation; search; time complexity; Data mining; Itemsets; Optimized production technology; Read only memory; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Artificial Intelligence Systems, 2002. (ICAIS 2002). 2002 IEEE International Conference on
Print_ISBN :
0-7695-1733-1
Type :
conf
DOI :
10.1109/ICAIS.2002.1048085
Filename :
1048085
Link To Document :
بازگشت