Title :
Automating XML schema matching: A composite approach
Author :
Al-Ghanim, M. ; Noah, S.A. ; Sembok, T.M.
Author_Institution :
Dept. of Inf. Sci., Univ. Kebangsaan Malaysia, Darul Ehsan, Malaysia
Abstract :
With the rise of hyperlinked networks and the broad spectrum of information that is accessible over the Web, a lot of data sources become available. XML standard receives large acceptance, the comparison of the Web to a “database” is closer to reality than ever before. One of the long-standing and challenging problems for many applications, is schema matching. While there are many schema-matching algorithms proposed in the literature to tackle this problem, most of them are not tackling the indirect mappings, just the few ones do, but still there is a need to cover the special cases of XML schemas. In this paper, we propose a new composite approach that detects direct and indirect mappings for XML schemas with higher accuracy, by proposing algorithms for XML schema matching, after combining them, to detect most kinds of mappings. Our approach to automating XML Schema matching combines terminological mappings with the data value characteristics and the expected data values for the element-level matching, which combined with the XML structural mappings by applying a dedicated XML Match taxonomy. We also, develop an ontology that fits to the structure of XML schemas, in order to solve semantic conflicts.
Keywords :
XML; semantic Web; World Wide Web; XML schema matching algorithm; XML standard; XML structural mapping; broad spectrum; data value characteristics; dedicated XML match taxonomy; eXtensible Markup Language; element-level matching; hyperlinked networks; terminological mapping; Accuracy; Cities and towns; Dictionaries; Ontologies; Semantics; Taxonomy; XML; Schema matching; Semantic mapping; XML schema mapping; direct and indirect mapping;
Conference_Titel :
Electrical Engineering and Informatics (ICEEI), 2011 International Conference on
Conference_Location :
Bandung
Print_ISBN :
978-1-4577-0753-7
DOI :
10.1109/ICEEI.2011.6021797