Title :
Comparison of Complete and Elementless Native Storage of XML Documents
Author :
Harder, Theo ; Mathis, Christian ; Schmidt, Karsten
Author_Institution :
Univ. of Kaiserslautern, Kaiserslautern
Abstract :
Because XML documents tend to be very large, are accessed by declarative and navigational languages, and often are processed in a collaborative way using read/write transactions, their fine-grained storage and management in XML DBMSs is a must for which, in turn, a flexible and space-economic tree representation is mandatory. In this paper, we explore a variety of options to natively store, encode, and compress XML documents thereby preserving the full DBMS processing flexibility on the documents required by the various language models and usage characteristics. Important issues of our empirical study are related to node labeling, document container layout, indexing, as well as structure and content compression. Encoding and compression of XML documents with their complete structure leads to a space consumption of ~40% to ~60% compared to their plain representation, whereas structure virtualization (elementless storage) saves in the average more than 10%, in addition.
Keywords :
XML; tree data structures; DBMS; XML documents; declarative languages; fine-grained storage; native storage; navigational languages; space-economic tree representation; structure virtualization; Adaptive systems; Collaboration; Containers; Encoding; Extraterrestrial measurements; Indexing; Labeling; Navigation; Protein engineering; XML;
Conference_Titel :
Database Engineering and Applications Symposium, 2007. IDEAS 2007. 11th International
Conference_Location :
Banff, Alta.
Print_ISBN :
978-0-7695-2947-9
DOI :
10.1109/IDEAS.2007.4318094