• DocumentCode
    2025990
  • Title

    Efficient single-pass frequent itemsets mining over data streams

  • Author

    Tan, Jun ; Bu, Yingyong ; Zhao, Haiming

  • Author_Institution
    Coll. of Comput. & Inf. Eng., Central South Univ. of Forestry & Technol., Changsha, China
  • Volume
    3
  • fYear
    2010
  • fDate
    10-12 Aug. 2010
  • Firstpage
    1438
  • Lastpage
    1441
  • Abstract
    Finding frequent itemsets is one of the most important issues in mining data streams for many applications such as web click stream mining, sensor networks, and network traffic analysis. Most prominent algorithms for traditional transaction databases need multiple scans, therefore, they are not suitable for data streams which are continuous, unbounded, usually come with high speed. In this paper, we propose a new single -pass algorithms which use the FP-tree data structure in combination with the IT-matrix technique which greatly reduces the need to traverse FP-trees. The experiment results on synthetic datasets and real datasets show that our proposed algorithm is an efficient method for mining frequent itemsets over data streams.
  • Keywords
    data mining; tree data structures; FP-tree data structure; IT-matrix technique; Web click stream mining; data stream mining; efficient single-pass frequent itemset mining; network traffic analysis; sensor networks; single -pass algorithms; synthetic datasets; transaction databases; traverse FP-trees; Algorithm design and analysis; Construction industry; Data mining; Data structures; Itemsets; Radiation detectors; Data streams; FP-growth; Frequent itemsets; IT-matrix;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
  • Conference_Location
    Yantai, Shandong
  • Print_ISBN
    978-1-4244-5931-5
  • Type

    conf

  • DOI
    10.1109/FSKD.2010.5569199
  • Filename
    5569199