DocumentCode :
2797540
Title :
Mining Tree Patterns Using Frequent 2-Subtree Checking
Author :
Deng, Dongjie ; Ma, Zhixin ; Xu, Yusheng ; Liu, Li
Author_Institution :
Sch. of Inf. Sci. & Eng., Lanzhou Univ., Lanzhou, China
Volume :
2
fYear :
2009
fDate :
Nov. 30 2009-Dec. 1 2009
Firstpage :
162
Lastpage :
165
Abstract :
In this paper, we systematically explore the problem of frequent subtree mining and present a novel pruning strategy, F2SC (frequent 2-subtree checking), which can be used in all Apriori-like subtree mining algorithms. With a little more memory overhead of keeping frequent 2-subtrees list, F2SC can prunes the invalid candidate which contains infrequent 2-subtree by checking the frequent 2-subtrees list and decreases the total cost of frequency counting effectively. At the same time, we optimize TREEMINER and present the improved algorithm TMp, which uses F2SC to prune invalid candidate subtree patterns. A set of comprehensive performance experiments demonstrates the efficiency of proposed pruning strategy by compare TMp with TREEMINER.
Keywords :
data mining; program verification; trees (mathematics); apriori-like subtree mining algorithms; frequent subtree mining; subtree checking; tree pattern mining; Costs; Data mining; Frequency; Information science; Iterative algorithms; Knowledge acquisition; Knowledge engineering; Labeling; Relational databases; Tree graphs; Frequent 2-Subtree Checking; pruning strategy; subtree mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Knowledge Acquisition and Modeling, 2009. KAM '09. Second International Symposium on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3888-4
Type :
conf
DOI :
10.1109/KAM.2009.171
Filename :
5362193
Link To Document :
بازگشت