DocumentCode
2057261
Title
Building regression cost models for multidatabase systems
Author
Zhu, Qiang ; Larson, Per-Åke
Author_Institution
Dept. of Comput. & Inf. Sci., Michigan Univ., Dearborn, MI, USA
fYear
1996
fDate
18-20 Dec 1996
Firstpage
220
Lastpage
231
Abstract
A major challenge for performing global query optimization in a multidatabase system (MDBS) is the lack of cost models for local database systems at the global level. The authors present a statistical procedure based on multiple regression analysis for building cost models for local database systems in an MDBS. Explanatory variables that can be included in a regression model are identified and a mixed forward and backward method for selecting significant explanatory variables is presented. Measures for developing useful regression cost models, such as removing outliers, eliminating multicollinearity, validating regression model assumptions, and checking significance of regression models, are discussed. Experimental results demonstrate that the presented statistical procedure can develop useful local cost models in an MDBS
Keywords
database theory; distributed databases; query processing; statistical analysis; explanatory variables; global query optimization; local database systems; mixed forward/backward method; multicollinearity elimination; multidatabase systems; multiple regression analysis; outlier removal; regression cost model building; regression model assumption validation; regression model significance checking; statistical procedure; Calibration; Cost function; Councils; Database systems; Laboratories; Optimization methods; Qualifications; Query processing; Sampling methods; Satellite broadcasting;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Information Systems, 1996., Fourth International Conference on
Conference_Location
Miami Beach, FL
Print_ISBN
0-8186-7475X
Type
conf
DOI
10.1109/PDIS.1996.568684
Filename
568684
Link To Document