DocumentCode
2732553
Title
Efficiently Detecting Inclusion Dependencies
Author
Bauckmann, Jana ; Leser, Ulf ; Naumann, Felix ; Tietz, Veronique
Author_Institution
Dept. for Comput. Sci., Humboldt-Univ. zu Berlin
fYear
2007
fDate
15-20 April 2007
Firstpage
1448
Lastpage
1450
Abstract
Data sources for data integration often come with spurious schema definitions such as undefined foreign key constraints. Such metadata are important for querying the database and for database integration. We present our algorithm SPIDER (single pass inclusion dependency recognition) for detecting inclusion dependencies, as these are the automatically testable part of a foreign key constraint. For IND detection all pairs of attributes must be tested. SPIDER solves this task very efficiently by testing all attribute pairs in parallel. It analyzes a 2 GB database in ~ 20 min and a 21 GB database in ~ 4 h.
Keywords
data integrity; meta data; query processing; SPIDER; data integration; data sources; database integration; database querying; inclusion dependencies detection; metadata; single pass inclusion dependency recognition; undefined foreign key constraints; Automatic testing; Computer science; Data structures; Filters; Proteins; Relational databases; System recovery;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2007. ICDE 2007. IEEE 23rd International Conference on
Conference_Location
Istanbul
Print_ISBN
1-4244-0802-4
Electronic_ISBN
1-4244-0803-2
Type
conf
DOI
10.1109/ICDE.2007.369032
Filename
4221822
Link To Document