Title :
Frequent pattern discovery based on co-occurrence frequent item tree
Author :
Hemalatha, R. ; Krishnan, A. ; Senthamarai, C. ; Hemamilini, R.
Author_Institution :
K.S.R. Coll. of Tech., Tamil Nadu, India
Abstract :
Existing association rule mining algorithms like a priori, partition algorithm, FP-growth suffers from many problems when mining massive transactional datasets. Some of the major problems are: (i) the repetitive I/O disk scans, (2) the huge computation involved during the candidacy generation, and (3) the high memory dependency. This paper presents the implementation of frequent itemset mining algorithm, COFI, which achieves its efficiency by applying four new ideas. First, it can mine using compact memory based data structures. Second, for each frequent item assigned, a relatively small independent tree is built summarizing co-occurrences. Third, clever pruning reduces the search space drastically. Finally, a simple and non-recursive mining process reduces the memory requirements as minimum candidacy generation and counting is needed to generate all relevant frequent patterns.
Keywords :
data mining; pattern classification; tree data structures; FP-tree growth; a priori algorithm; association rule mining algorithm; cooccurrence frequent item tree; frequent itemset mining algorithm; frequent pattern discovery; memory data structures; minimum candidacy generation; nonrecursive mining process; partition algorithm; repetitive I/O disk scans; transactional datasets; Association rules; Buildings; Computational efficiency; Data mining; Educational institutions; Itemsets; Partitioning algorithms; Tree data structures;
Conference_Titel :
Intelligent Sensing and Information Processing, 2005. Proceedings of 2005 International Conference on
Print_ISBN :
0-7803-8840-2
DOI :
10.1109/ICISIP.2005.1529474