DocumentCode
3589301
Title
QTPI:A quick terse path index for XML keyword search
Author
Li, Xia ; Li, Zhanhuai ; Peng Wang ; Chen, Qun
Author_Institution
Sch. of Comput. Sci. & Technol., Northwestern Polytech. Univ., Xi´´an, China
Volume
6
fYear
2010
Abstract
The emergence of the Web has increased interests in XML data. XML query languages such as XQuery, XPath and NEXI, they use label paths to traverse the irregularly structured data. Without efficient indexes, query processing can be quite inefficient due to an exhaustive traversal on XML data. To overcome the inefficiency, we propose a novel index method, quick terse path index (named QTPI), which contain the content and structure of the XML documents. Unlike index methods that disassemble a query into multiple sub-queries, and then join the results of these sub-queries to provide the final answers, QTPI uses tree structures as the basic unit of query to avoid expensive join operations. Furthermore, QTPI provides a terse index that can quickly derive the keyword query and generate a set of effective structured queries by analyzing the given keyword query and scanning the index, hence it has a performance advantage over methods indexing either. We have conducted an experimental study on real-life XML data sets and the experimental results show that QTPI is effective, and efficient in supporting structural queries when compared with existing proposals.
Keywords
Internet; XML; document handling; indexing; query languages; query processing; tree data structures; NEXI; QTPI; Web emergence; XML data; XML document; XML keyword search; XML query language; XPath; XQuery; index method; index scanning; novel index method; query processing; quick terse path index; real life XML data set; structured data; tree structure; Computer science; Database languages; Electronic mail; Indexes; Indexing; Information retrieval; Keyword search; Query processing; Web sites; XML; Keyword Search; Path Index; Structured Query; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Engineering and Technology (ICCET), 2010 2nd International Conference on
Print_ISBN
978-1-4244-6347-3
Type
conf
DOI
10.1109/ICCET.2010.5486251
Filename
5486251
Link To Document