DocumentCode
507342
Title
A New Algorithm for Mining Global Frequent Itemsets in a Stream
Author
Guo, Lichao ; Su, Hongye ; Qu, Yu
Author_Institution
State Key Lab. of Ind. Control Technol., Zhejiang Univ., Hangzhou, China
Volume
5
fYear
2009
fDate
14-16 Aug. 2009
Firstpage
232
Lastpage
238
Abstract
To find global frequent itemsets in a multiple, continuous, rapid and time-varying data stream, a fast, incremental, real-time, and little-memory-cost algorithm should be used. Based on the max-frequency window model, a BHS summary structure and a novel algorithm called GGFI-MFW are proposed. It merely updates the summaries for subsets of the data new arrived and could directly generate the max-frequency for a given item set without scanning the whole summary. Experiment results indicate that the proposed algorithm could efficiently find global frequent itemsets over a data stream with a small memory and perform overwhelming superiority for a large number of distinct items.
Keywords
data mining; BHS summary structure; GGFI-MFW; little-memory-cost algorithm; max-frequency window model; mining global frequent itemsets; time-varying data stream; Data mining; Frequency measurement; Frequency shift keying; Fuzzy systems; Ice; Industrial control; Itemsets; Marketing and sales; Research and development;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location
Tianjin
Print_ISBN
978-0-7695-3735-1
Type
conf
DOI
10.1109/FSKD.2009.265
Filename
5360625
Link To Document