• DocumentCode
    1973619
  • Title

    Sampling Based N-Hash Algorithm for Searching Frequent Itemset

  • Author

    Chen Yong-ming ; Zhu Mei-ling

  • Author_Institution
    Coll. of Math., Chengdu Univ. of Inf. Technol., Chengdu, China
  • fYear
    2010
  • fDate
    20-22 Aug. 2010
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Searching frequent itemsets is the critical problem in generating association rules in data mining, classic Hash-based technique, put forward by J. S. Park, for searching frequent itemsets has two shortcomings: one is that it is difficult to choose an appropriate hash function; the other is that it is liable to cause hash colliding. In order to solve the two problems, Chen Y.M. proposed N-Hash algorithm which needn´t to choose hash function and avoided hash colliding. In this paper, the sampling technique is employed to improve the efficiency of N-Hash algorithm.
  • Keywords
    data mining; file organisation; sampling methods; association rules; data mining; frequent itemsets searching; hash colliding; hash function; sampling based N-hash algorithm; sampling technique; Algorithm design and analysis; Association rules; Itemsets; Sampling methods; Strontium;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Internet Technology and Applications, 2010 International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-5142-5
  • Electronic_ISBN
    978-1-4244-5143-2
  • Type

    conf

  • DOI
    10.1109/ITAPP.2010.5566076
  • Filename
    5566076