DocumentCode :
1559465
Title :
Mining optimized association rules with categorical and numeric attributes
Author :
Rastogi, Rajeev ; Shim, Kyuseok
Author_Institution :
Lucent Technol. Bell Labs., Murray Hill, NJ, USA
Volume :
14
Issue :
1
fYear :
2002
Firstpage :
29
Lastpage :
50
Abstract :
Mining association rules on large data sets has received considerable attention in recent years. Association rules are useful for determining correlations between attributes of a relation and have applications in marketing, financial, and retail sectors. Furthermore, optimized association rules are an effective way to focus on the most interesting characteristics involving certain attributes. Optimized association rules are permitted to contain uninstantiated attributes and the problem is to determine instantiations such that either the support or confidence of the rule is maximized. In this paper, we generalize the optimized association rules problem in three ways: (1) association rules are allowed to contain disjunctions over uninstantiated attributes, (2) association rules are permitted to contain an arbitrary number of uninstantiated attributes, and (3) uninstantiated attributes can be either categorical or numeric. Our generalized association rules enable us to extract more useful information about seasonal and local patterns involving multiple attributes. We present effective techniques for pruning the search space when computing optimized association rules for both categorical and numeric attributes. Finally, we report the results of our experiments that indicate that our pruning algorithms are efficient for a large number of uninstantiated attributes, disjunctions, and values in the domain of the attributes
Keywords :
computational complexity; data mining; tree searching; very large databases; attributes; categorical attributes; knowledge discovery; numeric attributes; optimized association rules; optimized association rules mining; uninstantiated attributes; Association rules; Data mining;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.979971
Filename :
979971
Link To Document :
بازگشت