Title :
XQuery streaming by Forest Transducers
Author :
Hakuta, S. ; Maneth, Sebastian ; Nakano, Kaoru ; Iwasaki, Hisao
Author_Institution :
DOCOMO Datacom, Inc., Japan
fDate :
March 31 2014-April 4 2014
Abstract :
Streaming of XML transformations is a challenging task and only a few existing systems support streaming. Research approaches generally define custom fragments of XQuery and XPath that are amenable to streaming, and then design custom algorithms for each fragment. These languages have several shortcomings. Here we take a more principled approach to the problem of streaming XQuery-based transformations. We start with an elegant transducer model for which many static analysis problems are well-understood: the Macro Forest Transducer (MFT). We show that a large fragment of XQuery can be translated into MFTs - indeed, a fragment of XQuery, that can express important features that are missing from other XQuery stream engines, such as GCX: our fragment of XQuery supports XPath predicates and let-statements. We then use an existing streaming engine for MFTs and apply a well-founded set of optimizations from functional programming such as strictness analysis and deforestation. Our prototype achieves time and memory efficiency comparable to the fastest known engine for XQuery streaming, GCX. This is surprising because our engine relies on the OCaml built in garbage collector and does not use any specialized buffer management, while GCX´s efficiency is due to clever and explicit buffer management.
Keywords :
XML; buffer storage; functional programming; query processing; MFT; OCaml; XML transformation streaming; XPath; XQuery stream engines; XQuery streaming; buffer management; deforestation; functional programming; garbage collector; macro forest transducer; static analysis problems; strictness analysis;
Conference_Titel :
Data Engineering (ICDE), 2014 IEEE 30th International Conference on
Conference_Location :
Chicago, IL
DOI :
10.1109/ICDE.2014.6816714