DocumentCode :
3334324
Title :
Interpreting similarity measures: Bridging the gap between schema matching and data integration
Author :
Gal, Avigdor
Author_Institution :
Technion-Israel Inst. of Technol., Haifa
fYear :
2008
fDate :
7-12 April 2008
Firstpage :
278
Lastpage :
285
Abstract :
It has been recognized in the literature that the process of schema matching is uncertain. Such uncertainty at the core of data integration needs to be managed correctly to avoid dire consequences. Traditionally, manual intervention was required to make local decisions at the schema matching level to reach a deterministic matching before the rest of the data integration system can use it. Recently, however, researchers have argued for moving to fully-automatic transition of schema matching results into other data integration activities. In this work we discuss what it takes to bridge the gap between automatic schema matching and data integration. We briefly present the modeling of schema matching as an uncertain process, review a sufficient condition for using matcher similarity measure as a measure of schema matching correctness and provide a case study of data integration in peer database management system to demonstrate the benefit of our proposed gap bridging technique.
Keywords :
distributed databases; peer-to-peer computing; automatic deterministic schema matching; data integration system; heterogeneous distributed data sources; matcher similarity measure; peer database management system; peer-to-peer system; Bipartite graph; Bridges; Companies; Database systems; Distributed databases; HTML; Peer to peer computing; Sufficient conditions; Uncertainty; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-2161-9
Electronic_ISBN :
978-1-4244-2162-6
Type :
conf
DOI :
10.1109/ICDEW.2008.4498332
Filename :
4498332
Link To Document :
بازگشت