• DocumentCode
    2142921
  • Title

    Efficient Algorithm for Discovering Potential Interesting Patterns with Closed Itemsets

  • Author

    Singh, Raj ; Johnsten, Tom ; Raghavan, Vijay ; Xie, Ying

  • Author_Institution
    Sch. of Sci. & Comp. Eng., Univ. of Houston Clear Lake, Houston, TX, USA
  • fYear
    2010
  • fDate
    14-16 Aug. 2010
  • Firstpage
    414
  • Lastpage
    419
  • Abstract
    A pattern discovered from a collection of data is usually considered potentially interesting if its information content can assist the user in their decision making process. To that end, we have defined the potential interestingness of a pattern based on whether it provides statistical knowledge that is able to affect one´s belief system. In previous work, we proposed two novel algorithms, Discovering All Potentially Interesting Patterns (DAPIP) and All-Confidence Discovery of Potentially Interesting Patterns (ACDPIP), designed to discover potentially interesting patterns from a collection of data. Results of experimental investigations show that the application of these two algorithms is limited to non-dense datasets. In response, we propose a new algorithm, referred to as ACDPIP-Closed, designed to discover potential interesting patterns from dense datasets. We show empirically that ACDPIP-Closed is able to effectively and efficiently discover potentially interesting patterns from dense datasets. Additional contributions provided by the paper include a definition of a frequent closed itemset based on an all-confidence threshold and a theorem stating that, under the assumption of a particular ordering of items, an itemset is support based closed if and only if it is all-confidence based closed.
  • Keywords
    data mining; decision making; pattern classification; ACDPIP; DAPIP; all-confidence discovery of potentially interesting patterns; belief system; closed itemsets; data collection; decision making process; discovering all potentially interesting patterns; frequent closed itemset; information content; nondense datasets; potential interesting patterns; statistical knowledge; Algorithm design and analysis; Association rules; Context; Educational institutions; Itemsets; Noise; Redundancy; Closed itemsets; Data Mining; Interesting Patterns; Positive and Negative rules; assoiciation rules;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Granular Computing (GrC), 2010 IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • Print_ISBN
    978-1-4244-7964-1
  • Type

    conf

  • DOI
    10.1109/GrC.2010.55
  • Filename
    5575950