DocumentCode
3363429
Title
A fast and versatile path index for querying semi-structured data
Author
Barg, Michael ; Wong, Raymond K.
Author_Institution
Sch. of Comput. Sci. & Eng., New South Wales Univ., Sydney, NSW, Australia
fYear
2003
fDate
26-28 March 2003
Firstpage
249
Lastpage
256
Abstract
The richness of semi-structured data allows data of varied and inconsistent structures to be stored in a single database. Such data can be represented as a graph, and queries can be constructed using path expressions, which describe traversals through the graph. Instead of providing optimal performance for a limited range of path expressions, we propose a mechanism which is shown to have consistent and high performance for path expressions of any complexity, including those with descendant operators (path wildcards). We further detail mechanisms which employ our index to perform more complex processing, such as evaluating both path expressions containing links and entire (sub) queries containing path based predicates. Performance is shown to be independent of the number of terms in the path expression, even where these contain wildcards. Experiments show that our index is faster than conventional methods by up to two orders of magnitude for certain query types, is small, and scales well.
Keywords
graph theory; hypermedia markup languages; query processing; tree data structures; descendant operators; path based predicates; path expressions; path index; query types; querying; semi-structured data queries; wildcards; Computer science; Data engineering; Databases; Degradation; Encoding; Indexes; Indexing; Motion pictures; Performance evaluation; Query processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Database Systems for Advanced Applications, 2003. (DASFAA 2003). Proceedings. Eighth International Conference on
Conference_Location
Kyoto, Japan
Print_ISBN
0-7695-1895-8
Type
conf
DOI
10.1109/DASFAA.2003.1192389
Filename
1192389
Link To Document