DocumentCode :
2507164
Title :
Schema mediation in peer data management systems
Author :
Halevy, Alon Y. ; Ives, Zachary G. ; Suciu, Dan ; Tatarinov, Igor
Author_Institution :
Washington Univ., Seattle, WA, USA
fYear :
2003
fDate :
5-8 March 2003
Firstpage :
505
Lastpage :
516
Abstract :
Intuitively, data management and data integration tools should be well-suited for exchanging information in a semantically meaningful way. Unfortunately, they suffer from two significant problems: they typically require a comprehensive schema design before they can be used to store or share information, and they are difficult to extend because schema evolution is heavyweight and may break backwards compatibility. As a result, many small-scale data sharing tasks are more easily facilitated by nondatabase-oriented tools that have little support for semantics. The goal of the peer data management system (PDMS) is to address this need: we propose the use of a decentralized, easily extensible data management architecture in which any user can contribute new data, schema information, or even mappings between other peer´s schemas. PDMSs represent a natural step beyond data integration systems, replacing their single logical schema with an interlinked collection of semantic mappings between peer´s individual schemas. We consider the problem of schema mediation in a PDMS. Our first contribution is a flexible language for mediating between peer schemas, which extends known data integration formalisms to our more complex architecture. We precisely characterize the complexity of query answering for our language. Next, we describe a reformulation algorithm for our language that generalizes both global-as-view and local-as-view query answering algorithms. Finally, we describe several methods for optimizing the reformulation algorithm, and an initial set of experiments studying its performance.
Keywords :
data structures; distributed databases; query formulation; query languages; query processing; data integration tool; data management tool; peer data management system; query answering complexity; query optimisation; query reformulation; schema design; schema mediation; semantic information; semantic mapping; Cost function; Database languages; HTML; Investments; Mediation; Memory; Optimization methods; Standardization; Terminology; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2003. Proceedings. 19th International Conference on
Print_ISBN :
0-7803-7665-X
Type :
conf
DOI :
10.1109/ICDE.2003.1260817
Filename :
1260817
Link To Document :
بازگشت