• DocumentCode
    2732553
  • Title

    Efficiently Detecting Inclusion Dependencies

  • Author

    Bauckmann, Jana ; Leser, Ulf ; Naumann, Felix ; Tietz, Veronique

  • Author_Institution
    Dept. for Comput. Sci., Humboldt-Univ. zu Berlin
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Firstpage
    1448
  • Lastpage
    1450
  • Abstract
    Data sources for data integration often come with spurious schema definitions such as undefined foreign key constraints. Such metadata are important for querying the database and for database integration. We present our algorithm SPIDER (single pass inclusion dependency recognition) for detecting inclusion dependencies, as these are the automatically testable part of a foreign key constraint. For IND detection all pairs of attributes must be tested. SPIDER solves this task very efficiently by testing all attribute pairs in parallel. It analyzes a 2 GB database in ~ 20 min and a 21 GB database in ~ 4 h.
  • Keywords
    data integrity; meta data; query processing; SPIDER; data integration; data sources; database integration; database querying; inclusion dependencies detection; metadata; single pass inclusion dependency recognition; undefined foreign key constraints; Automatic testing; Computer science; Data structures; Filters; Proteins; Relational databases; System recovery;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2007. ICDE 2007. IEEE 23rd International Conference on
  • Conference_Location
    Istanbul
  • Print_ISBN
    1-4244-0802-4
  • Electronic_ISBN
    1-4244-0803-2
  • Type

    conf

  • DOI
    10.1109/ICDE.2007.369032
  • Filename
    4221822