Title :
Populating Data Warehouses with Semantic Data
Author :
Nebot, V. ; Berlanga, R.
Author_Institution :
Dept. de Lenguajes y Sist. Informaticos, Univ. Jaume I de Castellon, Castellon, Spain
fDate :
4/1/2010 12:00:00 AM
Abstract :
The Semantic Web has become a new environment that enables organizations to attach semantic annotations taken from domain and application ontologies to the information they generate. As a result, large amounts of complex, semi-structured and heterogeneus semantic data repositories are being made available. In this paper, we present an automatic method for on-demand construction of fact tables aimed at analyzing semantic data expressed as instance stores in RDF/OWL. The starting point of the method is a multidimensional star schema (i.e. topic of analysis, dimensions and measures) designed by the analyst by picking up concepts and properties from the ontology of the instance store. The method exploits the underlying semantics of ontology axioms in order to derive facts, that is, “valid combinations” of instances and literals that fulfill the analyst needs. Moreover, both the ontology axioms and the instances are indexed in order to reach scalability. We have tested the method with a synthetically generated instance set in order to show its scalability and effectiveness.
Keywords :
data warehouses; electronic data interchange; knowledge representation languages; meta data; ontologies (artificial intelligence); semantic Web; RDF/OWL; data warehouses; heterogeneus semantic data repositories; multidimensional star schema; ontologies; semantic Web; Data analysis; Data warehouses; Multidimensional systems; OWL; Ontologies; Resource description framework; Scalability; Semantic Web; Testing; Warehousing; Data warehousing; ETL processes; Semantic Web;
Journal_Title :
Latin America Transactions, IEEE (Revista IEEE America Latina)
DOI :
10.1109/TLA.2010.5514441