• DocumentCode
    3289927
  • Title

    Mining Recent Frequent Itemsets in Data Streams

  • Author

    Li, Kun ; Wang, Yong-yan ; Ellahi, Manzoor ; Wang, Hong-an

  • Author_Institution
    Intell. Eng. Lab., Inst. of Software Chinese Acad. of Sci., Beijing
  • Volume
    4
  • fYear
    2008
  • fDate
    18-20 Oct. 2008
  • Firstpage
    353
  • Lastpage
    358
  • Abstract
    Mining frequent itemsets in data streams is a hot research topic in recent years. Due to the continuous, high-speed and unbounded properties of data streams, traditional algorithms on static dataset are not suitable for mining in data streams. In this paper we present bounded frequent itemsets stream (abbreviated as BFI-stream) algorithm, which uses a prefix-tree based structure, called BFI-tree, to maintain all accurate frequent itemsets from sliding windows over data streams. By monitoring the boundary between frequent itemsets and infrequent itemsets, it restricts the update process on a small part of the tree. Mining all frequent itemsets with accurate frequencies is just to traverse the tree. It is time efficient even when the user-specified minimum support threshold is small. Experiments compare the time and space usage with MFI-TransSW, which also returns all accurate frequent itemsets from sliding windows. The results show that BFI-stream outperforms MFI-TransSW in both time and space at most time especially when the minimum support is small.
  • Keywords
    data mining; tree data structures; BFI-stream; MFI-TransSW; bounded frequent itemsets stream; data streams; prefix-tree based structure; Data engineering; Data mining; Delay; Financial management; Frequency; Fuzzy systems; Itemsets; Knowledge engineering; Monitoring; Telecommunication network management; data mining; data stream; frequent itemset;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
  • Conference_Location
    Jinan Shandong
  • Print_ISBN
    978-0-7695-3305-6
  • Type

    conf

  • DOI
    10.1109/FSKD.2008.255
  • Filename
    4666411