• DocumentCode
    2142871
  • Title

    A Probabilistic Approach to Apriori Algorithm

  • Author

    Sharma, Vaibhav ; Beg, M. M Sufyan

  • Author_Institution
    Comput. Sci. Dept., Inst. of Technol. & Manage., Gurgaon, India
  • fYear
    2010
  • fDate
    14-16 Aug. 2010
  • Firstpage
    402
  • Lastpage
    408
  • Abstract
    We consider the problem of applying probability concepts to discover frequent itemsets in a transaction database. The paper presents a probabilistic algorithm to discover association rules. The proposed algorithm outperforms the a priori algorithm for larger databases without losing a single rule. It involves a single database scan and significantly reduces the number of unsuccessful candidate sets generated in apriori algorithm that later fails the minimum support test. It uses the concept of recursive medians to compute the dispersion in the transaction list for each itemset. The recursive medians are implemented in the algorithm as an Inverted V-Median Search Tree (IVMST). The recursive medians are used to compute the maximum number of common transactions for any two itemsets. We try to present a time efficient probabilistic mechanism to discover frequent itemsets.
  • Keywords
    data mining; distributed databases; probability; IVMST; apriori algorithm; frequent itemsets discovery; inverted V-median search tree; probabilistic approach; recursive medians concept; single database scan; transaction database; Algorithm design and analysis; Classification algorithms; Data mining; Itemsets; Probabilistic logic; Probability density function; Data mining; KDD; apriori algorithm; association rules; frequent itemsets; probability; statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Granular Computing (GrC), 2010 IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • Print_ISBN
    978-1-4244-7964-1
  • Type

    conf

  • DOI
    10.1109/GrC.2010.69
  • Filename
    5575947