DocumentCode :
3525704
Title :
Efficient extraction of maximally common subtrees from XML documents for web services
Author :
Paik, Juryon ; Son, Y.J. ; Fouthoi, Farshad ; Kim, Ungmo
Author_Institution :
Dept. of Comput. Eng., Sungkyunkwan Univ., Gyeonggi
Volume :
2
fYear :
0
fDate :
0-0 0
Firstpage :
1371
Lastpage :
1375
Abstract :
Web services need to integrate and classify XML documents received from multiple and heterogeneous sources. To this end, it requires a mechanism for extracting common structures from a large XML dataset, called frequent subtrees. In this paper we propose an efficient and scalable algorithm, EMaxS, for mining frequent subtrees of Web XML documents stored in Web servers. Compared with previous works, the proposed algorithm uses only simple bitwise operations and does not require any join steps, which are typically expensive
Keywords :
Internet; XML; data mining; tree data structures; Web service; XML document handling; data mining; data structure; frequent pattern discovery; frequent subtrees; maximally common subtree; Computer science; Data mining; Data warehouses; Iterative methods; Simple object access protocol; Surges; Tree data structures; Web server; Web services; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Communication Technology, 2005, ICACT 2005. The 7th International Conference on
Conference_Location :
Phoenix Park
Type :
conf
DOI :
10.1109/ICACT.2005.246224
Filename :
1463049
Link To Document :
بازگشت