Title :
A Mining Maximal Frequent Itemsets over the Entire History of Data Streams
Author :
Mao, Yinmin ; Li, Hong ; Yang, Lumin ; Chen, Zhigang ; Liu, Lixin
Author_Institution :
Sch. of Inf. Sci. & Eng., Central South Univ., Changsha, China
Abstract :
Mining maximal frequent itemsets has been widely concerned. However, mining data streams is more difficult than mining static databases because of the huge, high-speed and continuous characteristics of streaming data. This paper presents an algorithm, called IDSM-MFI. The algorithm uses a synopsis data structure to store the items of transactions embedded data streams so far. It adopts a top-bottom and bottom-top method to mine the set of all maximal frequent itemsets in landmark windows over data stream, which can be output in real time based on users´ specified thresholds. Theoretical analysis and experimental results show that our algorithm is efficient and scalable for mining the set of all maximal frequent itemsets over the entire history of data stream.
Keywords :
data mining; data structures; IDSM-MFI algorithm; bottom-top method; data streams mining; landmark windows; maximal frequent itemset mining; synopsis data structure; top-bottom method; Algorithm design and analysis; Data engineering; Data mining; Data structures; Frequency; History; Information science; Itemsets; Sensor phenomena and characterization; Transaction databases; data mining; data streams; maximal frequent itemsets;
Conference_Titel :
Database Technology and Applications, 2009 First International Workshop on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3604-0
DOI :
10.1109/DBTA.2009.125