DocumentCode
3268317
Title
PRIX: indexing and querying XML using prufer sequences
Author
Rao, Praveen ; Moon, Bongki
Author_Institution
Dept. of Comput. Sci., Arizona Univ., Tucson, AZ, USA
fYear
2004
fDate
30 March-2 April 2004
Firstpage
288
Lastpage
299
Abstract
We propose a new way of indexing XML documents and processing twig patterns in an XML database. Every XML document in the database can be transformed into a sequence of labels by Prufer´s method that constructs a one-to-one correspondence between trees and sequences. During query processing, a twig pattern is also transformed into its Prufer sequence. By performing subsequence matching on the set of sequences in the database, and performing a series of refinement phases that we have developed, we can find all the occurrences of a twig pattern in the database. Our approach allows holistic processing of a twig pattern without breaking the twig into root-to-leaf paths and processing these paths individually. Furthermore, we show that all correct answers are found without any false dismissals or false alarms. Experimental results demonstrate the performance benefits of our proposed techniques.
Keywords
XML; database indexing; pattern matching; query processing; sequences; tree data structures; trees (mathematics); B-tree; Prufer sequence; XML database; XML document indexing; false alarm; holistic processing; query processing; refinement phase; root-to-leaf path; twig pattern processing; Computer science; Databases; Indexing; Information representation; Internet; Merging; Moon; Pattern matching; Query processing; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2004. Proceedings. 20th International Conference on
ISSN
1063-6382
Print_ISBN
0-7695-2065-0
Type
conf
DOI
10.1109/ICDE.2004.1320005
Filename
1320005
Link To Document