Title :
Interpreting similarity measures: Bridging the gap between schema matching and data integration
Author_Institution :
Technion-Israel Inst. of Technol., Haifa
Abstract :
It has been recognized in the literature that the process of schema matching is uncertain. Such uncertainty at the core of data integration needs to be managed correctly to avoid dire consequences. Traditionally, manual intervention was required to make local decisions at the schema matching level to reach a deterministic matching before the rest of the data integration system can use it. Recently, however, researchers have argued for moving to fully-automatic transition of schema matching results into other data integration activities. In this work we discuss what it takes to bridge the gap between automatic schema matching and data integration. We briefly present the modeling of schema matching as an uncertain process, review a sufficient condition for using matcher similarity measure as a measure of schema matching correctness and provide a case study of data integration in peer database management system to demonstrate the benefit of our proposed gap bridging technique.
Keywords :
distributed databases; peer-to-peer computing; automatic deterministic schema matching; data integration system; heterogeneous distributed data sources; matcher similarity measure; peer database management system; peer-to-peer system; Bipartite graph; Bridges; Companies; Database systems; Distributed databases; HTML; Peer to peer computing; Sufficient conditions; Uncertainty; XML;
Conference_Titel :
Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-2161-9
Electronic_ISBN :
978-1-4244-2162-6
DOI :
10.1109/ICDEW.2008.4498332