DocumentCode :
3122240
Title :
Sketch-Based Summarization of Ordered XML Streams
Author :
Mayorga, Veronica ; Polyzotis, Neoklis
Author_Institution :
Univ. of California at Santa Cruz, Santa Cruz, CA
fYear :
2009
fDate :
March 29 2009-April 2 2009
Firstpage :
541
Lastpage :
552
Abstract :
In this paper, we tackle the problem of approximately answering a continuous aggregate query over an XML stream using limited memory. This problem is key in the development of tools for the on-line monitoring and analysis of streaming XML data, such as complex event streams, RSS feeds, or workflow traces. We introduce a novel technique that supports XML queries with any combination of the common XPath axes, namely, ancestor, descendant, parent, child, following, preceding, following-sibling, and preceding-sibling. At the heart of our approach lies an efficient transform that reduces a continuous XML query to an equi-join query over relational streams. We detail the transform and discuss its integration with randomized sketches as a basic mechanism to estimate the result of the XML query. We further enhance this mechanism with structural sieving, a technique that takes advantage of the XML data and query characteristics in order to improve the accuracy of the sketch-based approximation. We present an extensive experimental study on real-life and synthetic data sets that validates the effectiveness of our approach and demonstrates its advantages over existing techniques.
Keywords :
XML; approximation theory; query processing; XML stream; XPath axes; randomized sketches; sketch-based approximation; sketch-based summarization; Aggregates; Data engineering; Data models; Fasteners; Feeds; Heart; Monitoring; Query processing; Testing; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
ISSN :
1084-4627
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
Type :
conf
DOI :
10.1109/ICDE.2009.107
Filename :
4812433
Link To Document :
بازگشت