DocumentCode
984400
Title
Generating compact redundancy-free XML documents from conceptual-model hypergraphs
Author
Mok, Wai Yin ; Embley, David W.
Author_Institution
Dept. of Econ. & Inf. Syst., Alabama Univ., Huntsville, AL
Volume
18
Issue
8
fYear
2006
Firstpage
1082
Lastpage
1096
Abstract
As XML data becomes more and more prevalent and as larger quantities of data find their way into XML documents, the need for quality XML data organization only increase. One standard way of structuring data well is to reduce and, if possible, eliminate redundancy, while at the same time making the storage structures as compact as possible. In this paper, we present a methodology to generate XML storage structures where conforming XML documents are redundancy-free, and for most practical cases, are also fully compact. Our methodology assumes the input is a conceptual-model hypergraph. For the special case that every edge in the hypergraph is binary, we present a simple algorithm, guaranteed to always generate redundancy-free storage structures. We show, however, that generating a minimum number of redundancy-free storage structures is NP-hard. We therefore provide heuristics to guide the process and observe that these heuristics result in satisfactory solutions, which are often optimal. We then present a general algorithm for n-ary edges and show that it generates redundancy-free storage structures. The general algorithm must overcome several problems that do not arise in the special case
Keywords
XML; computational complexity; document handling; entity-relationship modelling; redundancy; storage management; NP-hard; XML storage structure; conceptual-model hypergraph; n-ary edge algorithm; redundancy-free XML document; Computer science; Feedback; Information systems; Proposals; Storage automation; XML; XML data redundancy; XML scheme generation.; compact XML storage structures;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2006.125
Filename
1644731
Link To Document