Title :
Metric Functional Dependencies
Author :
Koudas, Nick ; Saha, Avishek ; Srivastava, Divesh ; Venkatasubramanian, Suresh
Author_Institution :
Dept. of Comput. Sci., Univ. of Toronto, Toronto, ON
fDate :
March 29 2009-April 2 2009
Abstract :
When merging data from various sources, it is often the case that small variations in data format and interpretation cause traditional functional dependencies (FDs) to be violated, without there being an intrinsic violation of semantics. Examples include differing address formats, or different reported latitude/longitudes for a given address. In this paper, we define metric functional dependencies, which strictly generalize traditional FDs by allowing small differences (controlled by a metric) in values of the consequent attribute of an FD. We present efficient algorithms for the verification problem: determining whether a given metric FD holds for a given relation. We experimentally demonstrate the validity and efficiency of our approach on various data sets that lie in multidimensional spaces.
Keywords :
database management systems; intrinsic violation; metric functional dependencies; multidimensional spaces; verification problem; Cities and towns; Clocks; Computer science; Data engineering; Databases; Merging; Multidimensional systems; Process design; Robustness; USA Councils; functional dependencies; metric spaces; verification algorithms;
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
DOI :
10.1109/ICDE.2009.219