• DocumentCode
    2677504
  • Title

    Data integration by describing sources with constraint databases

  • Author

    Cheng, Xun ; Dong, Guozhu ; Lau, Tzekwan ; Su, Jianwen

  • Author_Institution
    Dept. of Comput. Sci., California Univ., Santa Barbara, CA, USA
  • fYear
    1999
  • fDate
    23-26 Mar 1999
  • Firstpage
    374
  • Lastpage
    381
  • Abstract
    We develop a data integration approach for the efficient evaluation of queries over autonomous source databases. The approach is based on some novel applications and extensions of constraint database techniques. We assume the existence of a global database schema. The contents of each data source are described using a set of constraint tuples over the global schema; each such tuple indicates possible contributions from the source. The “source description catalog” (SDC) of a global relation consists of its associated constraint tuples. Such a method of description is advantageous since it is flexible to add new sources and to modify existing ones. In our framework, to evaluate a conjunctive query over the global schema, a plan generator first identifies relevant data sources by “evaluating” the query against the SDCs using techniques of constraint query evaluation; it then formulates an evaluation plan, consisting of some specialized queries over different paths. The evaluation of a query associated with a path is done by a sequence of partial evaluations at data sources along the path, similar to sideways information passing of Datalog; the partially evaluated queries travel along their associated paths. Our SDC based query planning is efficient since it avoids the NP-complete query rewriting process. We can achieve further optimization using techniques such as emptiness test
  • Keywords
    constraint handling; deductive databases; query processing; Datalog; NP-complete query rewriting process; SDC based query planning; autonomous source databases; conjunctive query; constraint databases; constraint query evaluation; constraint tuples; data integration approach; data source; data sources; emptiness test; evaluation plan; global database schema; global relation; global schema; partial evaluations; partially evaluated queries; plan generator; sideways information passing; source description catalog; specialized queries; Application software; Computer science; Data engineering; Data warehouses; Databases; Process planning; Query processing; Software libraries; Testing; Warehousing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 1999. Proceedings., 15th International Conference on
  • Conference_Location
    Sydney, NSW
  • ISSN
    1063-6382
  • Print_ISBN
    0-7695-0071-4
  • Type

    conf

  • DOI
    10.1109/ICDE.1999.754953
  • Filename
    754953