DocumentCode :
2368999
Title :
Mining high utility itemsets
Author :
Chan, Raymond Chan ; Yang, Qiang ; Shen, Yi-Dang
Author_Institution :
Dept. of Comput. Sci., Hong Kong Univ. of Sci. & Technol., China
fYear :
2003
fDate :
19-22 Nov. 2003
Firstpage :
19
Lastpage :
26
Abstract :
Traditional association rule mining algorithms only generate a large number of highly frequent rules, but these rules do not provide useful answers for what the high utility rules are. We develop a novel idea of top-K objective-directed data mining, which focuses on mining the top-K high utility closed patterns that directly support a given business objective. To association mining, we add the concept of utility to capture highly desirable statistical patterns and present a level-wise item-set mining algorithm. With both positive and negative utilities, the antimonotone pruning strategy in Apriori algorithm no longer holds. In response, we develop a new pruning strategy based on utilities that allow pruning of low utility itemsets to be done by means of a weaker but antimonotonic condition. Our experimental results show that our algorithm does not require a user specified minimum utility and hence is effective in practice.
Keywords :
data mining; probability; very large databases; Apriori algorithm; antimonotone pruning strategy; association rule mining; high utility itemset; statistical pattern; top-K objective-directed data mining; Data mining; Itemsets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
Type :
conf
DOI :
10.1109/ICDM.2003.1250893
Filename :
1250893
Link To Document :
بازگشت