DocumentCode :
2314904
Title :
Relational Operators in Heterogeneous Random Databases
Author :
Velcescu, Letitia ; Vasile, Laurentiu
Author_Institution :
Fac. of Math. & Inf., Univ. of Bucharest, Bucharest, Romania
fYear :
2009
fDate :
26-29 Sept. 2009
Firstpage :
407
Lastpage :
412
Abstract :
In this paper, we investigate the sizes of some approximate relational operations results, focusing on join, outer join and difference. We extend the notion of random database, in which the records are random vectors following a certain probability distribution, to heterogeneous random databases, in which each column can have its own unidimensional distribution. In this framework, we will investigate if the results already existing for the homogeneous databases remain true. Our approach follows three steps. First, we build up the histograms for some relational operations on heterogeneous tables with specific distributions, then we apply the chi square test of goodness of fit and, in the end, we prove the result that sets the limits for which the cardinality of the self-join can be approximated by a Poisson distribution.
Keywords :
Poisson distribution; distributed databases; relational databases; Poisson distribution; chi square test; goodness of fit; heterogeneous random databases; probability distribution; random vectors; relational operators; unidimensional distribution; Relational databases; Scientific computing; Poisson distribution; Random database; approximate relational operation; chi square test of goodness of fit; database optimization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), 2009 11th International Symposium on
Conference_Location :
Timisoara
Print_ISBN :
978-1-4244-5910-0
Electronic_ISBN :
978-1-4244-5911-7
Type :
conf
DOI :
10.1109/SYNASC.2009.50
Filename :
5460821
Link To Document :
بازگشت