• DocumentCode
    2797540
  • Title

    Mining Tree Patterns Using Frequent 2-Subtree Checking

  • Author

    Deng, Dongjie ; Ma, Zhixin ; Xu, Yusheng ; Liu, Li

  • Author_Institution
    Sch. of Inf. Sci. & Eng., Lanzhou Univ., Lanzhou, China
  • Volume
    2
  • fYear
    2009
  • fDate
    Nov. 30 2009-Dec. 1 2009
  • Firstpage
    162
  • Lastpage
    165
  • Abstract
    In this paper, we systematically explore the problem of frequent subtree mining and present a novel pruning strategy, F2SC (frequent 2-subtree checking), which can be used in all Apriori-like subtree mining algorithms. With a little more memory overhead of keeping frequent 2-subtrees list, F2SC can prunes the invalid candidate which contains infrequent 2-subtree by checking the frequent 2-subtrees list and decreases the total cost of frequency counting effectively. At the same time, we optimize TREEMINER and present the improved algorithm TMp, which uses F2SC to prune invalid candidate subtree patterns. A set of comprehensive performance experiments demonstrates the efficiency of proposed pruning strategy by compare TMp with TREEMINER.
  • Keywords
    data mining; program verification; trees (mathematics); apriori-like subtree mining algorithms; frequent subtree mining; subtree checking; tree pattern mining; Costs; Data mining; Frequency; Information science; Iterative algorithms; Knowledge acquisition; Knowledge engineering; Labeling; Relational databases; Tree graphs; Frequent 2-Subtree Checking; pruning strategy; subtree mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Knowledge Acquisition and Modeling, 2009. KAM '09. Second International Symposium on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-0-7695-3888-4
  • Type

    conf

  • DOI
    10.1109/KAM.2009.171
  • Filename
    5362193