• DocumentCode
    3088178
  • Title

    Integration of semistructured data with partial and inconsistent information

  • Author

    Liu, Mengchi ; Ling, Tok Wang ; Guan, Tao

  • Author_Institution
    Dept. of Comput. Sci., Regina Univ., Sask., Canada
  • fYear
    1999
  • fDate
    36373
  • Firstpage
    44
  • Lastpage
    52
  • Abstract
    Data integration from several sources has gained considerable attention with the recent popularity of the World Wide Web. In the real world, some information may be missing (i.e. partial) and some may be inconsistent from several sources. How to obtain information that is as complete as possible and how to detect inconsistency from these sources is thus an interesting question. Most existing work uses a simple graph-based or tree-based semistructured data model to represent heterogeneous data coming from various sites, which fails to account for the existence of partial and inconsistent information. In this paper, we redefine the notion of semistructured objects to reflect the existence of partial and inconsistent information and study how to integrate such objects spread over various sources and check their consistency in the meantime. We propose a new integration operator for this purpose and discuss its semantic properties
  • Keywords
    data integrity; data structures; database theory; distributed databases; information resources; World Wide Web; data integration; heterogeneous data; inconsistency detection; inconsistent information; information sources; integration operator; missing information; partial information; semantic properties; semistructured data; semistructured objects; Computer science; Data models; Database systems; Relational databases; Tree graphs; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Engineering and Applications, 1999. IDEAS '99. International Symposium Proceedings
  • Conference_Location
    Montreal, Que.
  • Print_ISBN
    0-7695-0265-2
  • Type

    conf

  • DOI
    10.1109/IDEAS.1999.787250
  • Filename
    787250