DocumentCode :
2548145
Title :
Exploiting Path Information for Syntax-Based XML Subtree Matching in RDBs
Author :
Liang, Wenxin ; Yokota, Haruo
Author_Institution :
Tokyo Inst. of Technol., Japan Sci. & Technol. Agency, Tokyo
fYear :
2008
fDate :
20-22 July 2008
Firstpage :
105
Lastpage :
112
Abstract :
In this paper, we propose two methods exploiting path information, direct-parent based method and full-path based method for syntax-based XML subtree matching in RDBs. In each proposed method, we discuss two ways of using the path information. The one is utilizing the path information after matching the leaf nodes. The other is using the path information together with the PCDATA value of leaf node as the join object. We perform experiments using the real bibliography XML documents stored in RDBs to evaluate the execution time, precision and recall of subtree matching. The experimental results indicate that both the two proposed path-based methods can effectively improve the precision and recall of subtree matching comparing with the original SLAX algorithm.
Keywords :
XML; relational databases; tree data structures; PCDATA value; SLAX algorithm; XML document; direct-parent based method; full-path based method; path information; relational database; syntax-based XML subtree matching; Bibliographies; Encyclopedias; Information management; Internet; Labeling; Large scale integration; Large-scale systems; Performance evaluation; Wikipedia; XML; XML data integration; XML path; syntax-based subtree matching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web-Age Information Management, 2008. WAIM '08. The Ninth International Conference on
Conference_Location :
Zhangjiajie Hunan
Print_ISBN :
978-0-7695-3185-4
Electronic_ISBN :
978-0-7695-3185-4
Type :
conf
DOI :
10.1109/WAIM.2008.28
Filename :
4597002
Link To Document :
بازگشت