DocumentCode :
787181
Title :
Optimizing queries with foreign functions in a distributed environment
Author :
Tsai, Pauray S M ; Chen, Arbee L P
Author_Institution :
Dept. of Inf. Manage., Ming Hsin Inst. of Technol., Hsinchu, Taiwan
Volume :
14
Issue :
4
fYear :
2002
Firstpage :
809
Lastpage :
824
Abstract :
Foreign functions have been considered in the advanced database systems to support complex applications. We consider optimizing queries with foreign functions in a distributed environment. In traditional distributed query processing, selection operations are locally processed before joins as much as possible so that the size of relations being transmitted and joined can be reduced. However, if selection predicates involve foreign functions, the cost of evaluating selections cannot be ignored. As a result, the execution order of selections and joins becomes significant, and the trade-off for reducing the costs of data transmission, join processing, and selection predicate evaluation needs to be carefully considered in query optimization. A response time model is developed for estimating the cost of distributed query processing involving foreign functions. We explore the property of the problem and find an optimal algorithm with polynomial complexity for a special case of it. However, finding the optimal execution plan for the general case is NP-hard. We propose an efficient heuristic algorithm for solving the problem and the simulation result shows its good quality. The research result can also be applied to the advanced database systems and the multidatabase systems where the conversion function defined for the need of schema integration can be considered a type of foreign functions
Keywords :
computational complexity; database theory; distributed databases; optimisation; query processing; relational databases; NP-hard; data transmission; distributed database; distributed query processing; foreign functions; heuristic algorithm; joins; multidatabase systems; polynomial complexity; query optimization; relational database; response time model; schema integration; selection operations; selection predicate evaluation; Computational efficiency; Cost function; Data communication; Database systems; Delay; Heuristic algorithms; Object oriented modeling; Polynomials; Query processing; Relational databases;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2002.1019215
Filename :
1019215
Link To Document :
بازگشت