Title :
Mining Tree Patterns Using Frequent 2-Subtree Checking
Author :
Deng, Dongjie ; Ma, Zhixin ; Xu, Yusheng ; Liu, Li
Author_Institution :
Sch. of Inf. Sci. & Eng., Lanzhou Univ., Lanzhou, China
fDate :
Nov. 30 2009-Dec. 1 2009
Abstract :
In this paper, we systematically explore the problem of frequent subtree mining and present a novel pruning strategy, F2SC (frequent 2-subtree checking), which can be used in all Apriori-like subtree mining algorithms. With a little more memory overhead of keeping frequent 2-subtrees list, F2SC can prunes the invalid candidate which contains infrequent 2-subtree by checking the frequent 2-subtrees list and decreases the total cost of frequency counting effectively. At the same time, we optimize TREEMINER and present the improved algorithm TMp, which uses F2SC to prune invalid candidate subtree patterns. A set of comprehensive performance experiments demonstrates the efficiency of proposed pruning strategy by compare TMp with TREEMINER.
Keywords :
data mining; program verification; trees (mathematics); apriori-like subtree mining algorithms; frequent subtree mining; subtree checking; tree pattern mining; Costs; Data mining; Frequency; Information science; Iterative algorithms; Knowledge acquisition; Knowledge engineering; Labeling; Relational databases; Tree graphs; Frequent 2-Subtree Checking; pruning strategy; subtree mining;
Conference_Titel :
Knowledge Acquisition and Modeling, 2009. KAM '09. Second International Symposium on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3888-4
DOI :
10.1109/KAM.2009.171