Title :
A new approach to online generation of association rules
Author :
Aggarwal, Charu C. ; Yu, Philip S.
Author_Institution :
IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
Abstract :
We discuss the problem of online mining of association rules in a large database of sales transactions. The online mining is performed by preprocessing the data effectively in order to make it suitable for repeated online queries. We store the preprocessed data in such a way that online processing may be done by applying a graph theoretic search algorithm whose complexity is proportional to the size of the output. The result is an online algorithm which is independent of the size of the transactional data and the size of the preprocessed data. The algorithm is almost instantaneous in the size of the output. The algorithm also supports techniques for quickly discovering association rules from large itemsets. The algorithm is capable of finding rules with specific items in the antecedent or consequent. These association rules are presented in a compact form, eliminating redundancy. The use of nonredundant association rules helps significantly in the reduction of irrelevant noise in the data mining process
Keywords :
data mining; marketing data processing; very large databases; antecedent; complexity; consequent; data mining; graph theoretic search algorithm; large database; large itemset; noise reduction; nonredundant association rules; online association rule generation; online mining; preprocessing; repeated online queries; sales transactions; Association rules; Data mining; Itemsets; Marketing and sales; Noise reduction; Promotion - marketing; Transaction databases;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on