Title :
XMiner: Mining XML Mediated Schemas
Author :
Nguyen, Hong-Quang ; Rahayu, Wenny ; Nguyen, Kinh ; Taniar, David
Author_Institution :
Dept of Comput. Sci. & Comput. Eng., La Trobe Univ., Melbourne, VIC
Abstract :
This paper presents a novel schema mediation approach, called XMiner, for mining mediated schemas from a set of XML schemas. XMiner addresses three main problems resulting from the heterogeneous source schemas: nesting discrepancy, backward paths and schema discrepancy. XMiner discovers frequent substructures using frequent subtree mining algorithms, and then constructs a mediated schemas. XMiner aims to preserve the hierarchical structure as the best as possible while avoiding information loss. XMiner exploits structural context, forward/backward paths, and label semantics for matching, mapping and merging frequent substructures. Experiments on real and synthetic datasets are reported to show that XMiner offers acceptable performance and quality for large-scale application scenarios.
Keywords :
XML; data mining; XML mediated schemas mining; XMiner; frequent subtree mining algorithms; label semantics; nesting discrepancy; schema discrepancy; schema mediation approach; Australia; Computer science; Information technology; Intelligent agent; Large-scale systems; Machine learning; Machine learning algorithms; Mediation; Motion pictures; XML; frequent subtree mining; mediated schema; schema integration; schema matching; schema mediation;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3496-1
DOI :
10.1109/WIIAT.2008.301