DocumentCode
1781416
Title
Constrained Frequent Subtree Mining Method
Author
Kun Han ; Weifeng Lv ; Baocai Yin ; Yongli Hu
fYear
2014
fDate
28-30 Nov. 2014
Firstpage
287
Lastpage
292
Abstract
With the semi-structured data rapidly growing, it is crucial to obtain valuable information for different applications. So many data mining methods are proposed and the frequent sub trees mining is an important and typical method. The current mining methods demand substantial computational time and space, and return a huge number of patterns, but some important sub trees are often missed and some patterns are uninteresting to users. In this paper we proposed two novel algorithms, namely FSMDC and FSMIC, for mining frequent embedded sub trees from rooted labeled ordered trees database. In these proposed algorithms, the distance and interest constraint are introduced respectively to achieve expected mining results. The experiments show that the two newly developed algorithms are efficient, scalable and more consistent with purpose of users.
Keywords
data mining; database management systems; trees (mathematics); FSMDC; FSMIC; constrained frequent subtree mining method; data mining methods; distance constraint; interest constraint; rooted labeled ordered trees database; Algorithm design and analysis; Classification algorithms; Data mining; Databases; Educational institutions; Mathematical model; Vegetation; distance constraint; frequent subtree mining; interest constraint;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Home (ICDH), 2014 5th International Conference on
Conference_Location
Guangzhou
Print_ISBN
978-1-4799-4285-5
Type
conf
DOI
10.1109/ICDH.2014.62
Filename
6996777
Link To Document