DocumentCode
2142921
Title
Efficient Algorithm for Discovering Potential Interesting Patterns with Closed Itemsets
Author
Singh, Raj ; Johnsten, Tom ; Raghavan, Vijay ; Xie, Ying
Author_Institution
Sch. of Sci. & Comp. Eng., Univ. of Houston Clear Lake, Houston, TX, USA
fYear
2010
fDate
14-16 Aug. 2010
Firstpage
414
Lastpage
419
Abstract
A pattern discovered from a collection of data is usually considered potentially interesting if its information content can assist the user in their decision making process. To that end, we have defined the potential interestingness of a pattern based on whether it provides statistical knowledge that is able to affect one´s belief system. In previous work, we proposed two novel algorithms, Discovering All Potentially Interesting Patterns (DAPIP) and All-Confidence Discovery of Potentially Interesting Patterns (ACDPIP), designed to discover potentially interesting patterns from a collection of data. Results of experimental investigations show that the application of these two algorithms is limited to non-dense datasets. In response, we propose a new algorithm, referred to as ACDPIP-Closed, designed to discover potential interesting patterns from dense datasets. We show empirically that ACDPIP-Closed is able to effectively and efficiently discover potentially interesting patterns from dense datasets. Additional contributions provided by the paper include a definition of a frequent closed itemset based on an all-confidence threshold and a theorem stating that, under the assumption of a particular ordering of items, an itemset is support based closed if and only if it is all-confidence based closed.
Keywords
data mining; decision making; pattern classification; ACDPIP; DAPIP; all-confidence discovery of potentially interesting patterns; belief system; closed itemsets; data collection; decision making process; discovering all potentially interesting patterns; frequent closed itemset; information content; nondense datasets; potential interesting patterns; statistical knowledge; Algorithm design and analysis; Association rules; Context; Educational institutions; Itemsets; Noise; Redundancy; Closed itemsets; Data Mining; Interesting Patterns; Positive and Negative rules; assoiciation rules;
fLanguage
English
Publisher
ieee
Conference_Titel
Granular Computing (GrC), 2010 IEEE International Conference on
Conference_Location
San Jose, CA
Print_ISBN
978-1-4244-7964-1
Type
conf
DOI
10.1109/GrC.2010.55
Filename
5575950
Link To Document