Title :
Modeling Semantic Heterogeneity in Dataspace: A Machine Learning Approach
Author :
Singh, Mrityunjay ; Jain, S.K. ; Panchal, V.K.
Author_Institution :
Comput. Eng. Dept., Nat. Inst. of Technol., Kurukshetra, India
Abstract :
A data space system facilitates a new way for sharing and integrating the information among the various distributed, autonomous and heterogeneous data sources. To provide the best effort answer of a user query, a data space system needs to resolve the semantic heterogeneity in its core. There are many solutions being proposed to address this problem widely. We are exploring the problem of semantic heterogeneity in a data space system as a part of our PhD work. In this paper, we have addressed the semantic heterogeneity in the context of a data space system, and presented an abstract framework to model the semantic heterogeneity in data space. The proposed model is based on machine learning and ontology approaches. The machine learning technique analyzes the semantically equivalent data items (or entities) in data space, and the ontology conceptualizes the structural entities in a data space. This model resolves the semantic heterogeneity of a data space system, and creates a conceptual model using "from-data-to-schema" approach. The proposed approach implicitly creates the domain ontology by finding the most similar concepts comming from different data sources and enriches the performance of the system by finding the semantic relationships among them.
Keywords :
learning (artificial intelligence); ontologies (artificial intelligence); query processing; relational databases; abstract framework; data sources; dataspace system; distributed-autonomous heterogeneous data sources; domain ontology; from-data-to-schema approach; information integration; information sharing; machine learning approach; ontology approach; semantic heterogeneity modeling; semantic relationships; semantically equivalent data item analysis; structural entity conceptualization; user query answering; Data mining; Data models; Distributed databases; Ontologies; Prototypes; Semantics; Dataspace; Semantic Heterogeneity; User Feedback; from-data-to-schema; on-the-fly;
Conference_Titel :
Information Technology (ICIT), 2014 International Conference on
Conference_Location :
Bhubaneswar
Print_ISBN :
978-1-4799-8083-3
DOI :
10.1109/ICIT.2014.24