DocumentCode
3525704
Title
Efficient extraction of maximally common subtrees from XML documents for web services
Author
Paik, Juryon ; Son, Y.J. ; Fouthoi, Farshad ; Kim, Ungmo
Author_Institution
Dept. of Comput. Eng., Sungkyunkwan Univ., Gyeonggi
Volume
2
fYear
0
fDate
0-0 0
Firstpage
1371
Lastpage
1375
Abstract
Web services need to integrate and classify XML documents received from multiple and heterogeneous sources. To this end, it requires a mechanism for extracting common structures from a large XML dataset, called frequent subtrees. In this paper we propose an efficient and scalable algorithm, EMaxS, for mining frequent subtrees of Web XML documents stored in Web servers. Compared with previous works, the proposed algorithm uses only simple bitwise operations and does not require any join steps, which are typically expensive
Keywords
Internet; XML; data mining; tree data structures; Web service; XML document handling; data mining; data structure; frequent pattern discovery; frequent subtrees; maximally common subtree; Computer science; Data mining; Data warehouses; Iterative methods; Simple object access protocol; Surges; Tree data structures; Web server; Web services; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Communication Technology, 2005, ICACT 2005. The 7th International Conference on
Conference_Location
Phoenix Park
Type
conf
DOI
10.1109/ICACT.2005.246224
Filename
1463049
Link To Document