DocumentCode :
3426417
Title :
An improved multiple minimum support based approach to mine rare association rules
Author :
Kiran, R. Uday ; Krishna Re, P.
fYear :
2009
fDate :
March 30 2009-April 2 2009
Firstpage :
340
Lastpage :
347
Abstract :
In this paper we have proposed an improved approach to extract rare association rules. Rare association rules are the association rules containing rare items. Rare items are less frequent items. For extracting rare itemsets, the single minimum support (minsup) based approaches like Apriori approach suffer from ldquorare item problemrdquo dilemma. At high minsup value, rare itemsets are missed, and at low minsup value, the number of frequent itemsets explodes. To extract rare itemsets, an effort has been made in the literature in which minsup of each item is fixed equal to the percentage of its support. Even though this approach improves the performance over single minsup based approaches, it still suffers from ldquorare item problemrdquo dilemma. If minsup for the item is fixed by setting the percentage value high, the rare itemsets are missed as the minsup for the rare items becomes close to their support, and if minsup for the item is fixed by setting the percentage value low, the number of frequent itemsets explodes. In this paper, we propose an improved approach in which minsup is fixed for each item based on the notion of ldquosupport differencerdquo. The proposed approach assigns appropriate minsup values for frequent as well as rare items based on their item supports and reduces both ldquorule missingrdquo and ldquorule explosionrdquo problems. Experimental results on both synthetic and real world datasets show that the proposed approach improves performance over existing approaches by minimizing the explosion of number of frequent itemsets involving frequent items and without missing the frequent itemsets involving rare items.
Keywords :
data mining; mine rare association rules; minsup; multiple minimum support; rule explosion; rule missing; Association rules; Data mining; Explosions; Frequency; Itemsets; Terminology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Data Mining, 2009. CIDM '09. IEEE Symposium on
Conference_Location :
Nashville, TN
Print_ISBN :
978-1-4244-2765-9
Type :
conf
DOI :
10.1109/CIDM.2009.4938669
Filename :
4938669
Link To Document :
بازگشت