• DocumentCode
    3165295
  • Title

    Mining Frequent Itemsets in a Stream

  • Author

    Calders, Toon ; Dexters, Nele ; Goethals, Bart

  • fYear
    2007
  • fDate
    28-31 Oct. 2007
  • Firstpage
    83
  • Lastpage
    92
  • Abstract
    We study the problem of finding frequent itemsets in a continuous stream of transactions. The current frequency of an itemset in a stream is defined as its maximal frequency over all possible windows in the stream from any point in the past until the current state that satisfy a minimal length constraint. Properties of this new measure are studied and an incremental algorithm that allows, at any time, to immediately produce the current frequencies of all frequent itemsets is proposed. Experimental and theoretical analysis show that the space requirements for the algorithm are extremely small for many realistic data distributions.
  • Keywords
    data mining; continuous stream; data distributions; frequent itemsets mining; incremental algorithm; minimal length constraint; Algorithm design and analysis; Current measurement; Data mining; Databases; Frequency measurement; History; Ice; Itemsets; Marketing and sales; Time measurement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
  • Conference_Location
    Omaha, NE
  • ISSN
    1550-4786
  • Print_ISBN
    978-0-7695-3018-5
  • Type

    conf

  • DOI
    10.1109/ICDM.2007.66
  • Filename
    4470232