DocumentCode :
1426199
Title :
Semantic query optimization for query plans of heterogeneous multidatabase systems
Author :
Hsu, Chun-Nan ; Knoblock, Craig A.
Author_Institution :
Inst. of Inf. Sci., Acad. Sinica, Taipei, Taiwan
Volume :
12
Issue :
6
fYear :
2000
Firstpage :
959
Lastpage :
978
Abstract :
New applications of information systems need to integrate a large number of heterogeneous databases over computer networks. Answering a query in these applications usually involves selecting relevant information sources and generating a query plan to combine the data automatically. As significant progress has been made in source selection and plan generation, the critical issue has been shifting to query optimization. This paper presents a semantic query optimization (SQO) approach to optimizing query plans of heterogeneous multidatabase systems. This approach provides global optimization for query plans as well as local optimization for subqueries that retrieve data from individual database sources. An important feature of our local optimization algorithm is that we prove necessary and sufficient conditions to eliminate an unnecessary join in a conjunctive query of arbitrary join topology. This feature allows our optimizer to utilize more expressive relational rules to provide a wider range of possible optimizations than previous work in SQO. The local optimization algorithm also features a new data structure called AND-OR implication graphs to facilitate the search for optimal queries. These features allow the global optimization to effectively use semantic knowledge to reduce the data transmission cost. We have implemented this approach in the PESTO (Plan Enhancement by SemanTic Optimization) query plan optimizer as a part of the SIMS information mediator. Experimental results demonstrate that PESTO can provide significant savings in query execution cost over query plan execution without optimization
Keywords :
data structures; distributed databases; query processing; relational databases; AND-OR implication graphs; PESTO; SIMS information mediator; computer networks; conjunctive query; data retrieval; data structure; data transmission cost; expressive relational rules; global optimization; heterogeneous multidatabase systems; information system applications; join topology; local optimization; necessary conditions; optimal query searching; query execution cost; query plans; relevant information source selection; semantic query optimization; subqueries; sufficient conditions; unnecessary join elimination; Application software; Computer networks; Cost function; Information retrieval; Information systems; Query processing; Relational databases; Spatial databases; Sufficient conditions; Topology;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.895804
Filename :
895804
Link To Document :
بازگشت