Title :
Entity identification in database integration
Author :
Lim, Ee-Peng ; Srivastava, Jaideep ; Prabhakar, Satya ; Richardson, James
Author_Institution :
Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
Abstract :
The objective of entity identification is to determine the correspondence between object instances from more than one database. Entity identification at the instance level, assuming that schema level heterogeneity has been resolved a priori, is examined. Soundness and completeness are defined as the desired properties of any entity identification technique. To achieve soundness, a set of identity and distinctness rules are established for entities in the integrated world. The use of extended key, which is the union of keys, and possibly other attributes, from the relations to be matched, and its corresponding identify rule are proposed to determine the equivalence between tuples from relations which may not share any common key. Instance level functional dependencies (ILFD), a form of semantic constraint information about the real-world entities, are used to derive the missing extended key attribute values of a tuple
Keywords :
database theory; distributed databases; ILFD; completeness; database integration; distinctness rules; entity identification; extended key; extended key attribute values; instance level; instance level functional dependencies; integrated world; object instances; real-world entities; schema level heterogeneity; semantic constraint information; tuple; Computer science; Context modeling; Contracts; Databases; Electric breakdown; Laboratories; Time factors;
Conference_Titel :
Data Engineering, 1993. Proceedings. Ninth International Conference on
Conference_Location :
Vienna
Print_ISBN :
0-8186-3570-3
DOI :
10.1109/ICDE.1993.344053