• DocumentCode
    3334324
  • Title

    Interpreting similarity measures: Bridging the gap between schema matching and data integration

  • Author

    Gal, Avigdor

  • Author_Institution
    Technion-Israel Inst. of Technol., Haifa
  • fYear
    2008
  • fDate
    7-12 April 2008
  • Firstpage
    278
  • Lastpage
    285
  • Abstract
    It has been recognized in the literature that the process of schema matching is uncertain. Such uncertainty at the core of data integration needs to be managed correctly to avoid dire consequences. Traditionally, manual intervention was required to make local decisions at the schema matching level to reach a deterministic matching before the rest of the data integration system can use it. Recently, however, researchers have argued for moving to fully-automatic transition of schema matching results into other data integration activities. In this work we discuss what it takes to bridge the gap between automatic schema matching and data integration. We briefly present the modeling of schema matching as an uncertain process, review a sufficient condition for using matcher similarity measure as a measure of schema matching correctness and provide a case study of data integration in peer database management system to demonstrate the benefit of our proposed gap bridging technique.
  • Keywords
    distributed databases; peer-to-peer computing; automatic deterministic schema matching; data integration system; heterogeneous distributed data sources; matcher similarity measure; peer database management system; peer-to-peer system; Bipartite graph; Bridges; Companies; Database systems; Distributed databases; HTML; Peer to peer computing; Sufficient conditions; Uncertainty; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on
  • Conference_Location
    Cancun
  • Print_ISBN
    978-1-4244-2161-9
  • Electronic_ISBN
    978-1-4244-2162-6
  • Type

    conf

  • DOI
    10.1109/ICDEW.2008.4498332
  • Filename
    4498332