DocumentCode
2797540
Title
Mining Tree Patterns Using Frequent 2-Subtree Checking
Author
Deng, Dongjie ; Ma, Zhixin ; Xu, Yusheng ; Liu, Li
Author_Institution
Sch. of Inf. Sci. & Eng., Lanzhou Univ., Lanzhou, China
Volume
2
fYear
2009
fDate
Nov. 30 2009-Dec. 1 2009
Firstpage
162
Lastpage
165
Abstract
In this paper, we systematically explore the problem of frequent subtree mining and present a novel pruning strategy, F2SC (frequent 2-subtree checking), which can be used in all Apriori-like subtree mining algorithms. With a little more memory overhead of keeping frequent 2-subtrees list, F2SC can prunes the invalid candidate which contains infrequent 2-subtree by checking the frequent 2-subtrees list and decreases the total cost of frequency counting effectively. At the same time, we optimize TREEMINER and present the improved algorithm TMp, which uses F2SC to prune invalid candidate subtree patterns. A set of comprehensive performance experiments demonstrates the efficiency of proposed pruning strategy by compare TMp with TREEMINER.
Keywords
data mining; program verification; trees (mathematics); apriori-like subtree mining algorithms; frequent subtree mining; subtree checking; tree pattern mining; Costs; Data mining; Frequency; Information science; Iterative algorithms; Knowledge acquisition; Knowledge engineering; Labeling; Relational databases; Tree graphs; Frequent 2-Subtree Checking; pruning strategy; subtree mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Knowledge Acquisition and Modeling, 2009. KAM '09. Second International Symposium on
Conference_Location
Wuhan
Print_ISBN
978-0-7695-3888-4
Type
conf
DOI
10.1109/KAM.2009.171
Filename
5362193
Link To Document