DocumentCode
3334324
Title
Interpreting similarity measures: Bridging the gap between schema matching and data integration
Author
Gal, Avigdor
Author_Institution
Technion-Israel Inst. of Technol., Haifa
fYear
2008
fDate
7-12 April 2008
Firstpage
278
Lastpage
285
Abstract
It has been recognized in the literature that the process of schema matching is uncertain. Such uncertainty at the core of data integration needs to be managed correctly to avoid dire consequences. Traditionally, manual intervention was required to make local decisions at the schema matching level to reach a deterministic matching before the rest of the data integration system can use it. Recently, however, researchers have argued for moving to fully-automatic transition of schema matching results into other data integration activities. In this work we discuss what it takes to bridge the gap between automatic schema matching and data integration. We briefly present the modeling of schema matching as an uncertain process, review a sufficient condition for using matcher similarity measure as a measure of schema matching correctness and provide a case study of data integration in peer database management system to demonstrate the benefit of our proposed gap bridging technique.
Keywords
distributed databases; peer-to-peer computing; automatic deterministic schema matching; data integration system; heterogeneous distributed data sources; matcher similarity measure; peer database management system; peer-to-peer system; Bipartite graph; Bridges; Companies; Database systems; Distributed databases; HTML; Peer to peer computing; Sufficient conditions; Uncertainty; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on
Conference_Location
Cancun
Print_ISBN
978-1-4244-2161-9
Electronic_ISBN
978-1-4244-2162-6
Type
conf
DOI
10.1109/ICDEW.2008.4498332
Filename
4498332
Link To Document