Title :
DSMFI-Miner: An Algorithm for Mining Maximal Frequent Itemsets on Data Streams
Author :
Jiadong, Ren ; Huiling, He ; Lina, Xu ; Hu Changzhen
Author_Institution :
Coll. of Inf. Sci. & Eng., YanShan Univ., Qinhuangdao, China
Abstract :
In order to improve the mining efficiency of frequent itemsets on data streams, we present an algorithm DSMFI_Miner for mining maximal frequent itemsets on data streams. First, a data structure DSMFI_tree is constructed to store the potential frequent itemsets and the data stream is divided into a set of segments, then the potential maximal frequent itemsets on each segment are obtained by a breadth-first algorithm, while the generated itemsets and their subsets are stored in the local DSMFI_tree which is updated dynamically, finally, the maximal frequent itemsets on the data stream can be rapidly found by a bottom-up search strategy from DSMFI_tree. The experimental result shows that the execution efficiency of DSMFI_Miner is better than that of INSTANT algorithm.
Keywords :
data mining; data structures; tree searching; trees (mathematics); DSMFI-miner; DSMFI_tree; INSTANT algorithm; bottom-up search strategy; breadth-first algorithm; data streams; data structure; mining efficiency; mining maximal frequent itemsets; potential maximal frequent itemsets; Computer science; Data engineering; Data mining; Data structures; Databases; Educational institutions; Helium; Information science; Itemsets; Testing; data streams; frequent itemsets; maximal frequent itemsets;
Conference_Titel :
Computer Science and Engineering, 2009. WCSE '09. Second International Workshop on
Conference_Location :
Qingdao
Print_ISBN :
978-0-7695-3881-5
DOI :
10.1109/WCSE.2009.783