Title :
MDDQL-Stat: data querying and analysis through integration of intentional and extensional semantics
Author :
Kapetanios, Epaminondas ; Baer, David ; Glaus, Böjrn ; Groenewoud, Paul
Author_Institution :
Plirosoft Ltd., Semantic Technol., Zurich, Switzerland
Abstract :
We would like to present a prototype system enabling a rather empirical than a formal approach to the problem of posing queries to a semantically rich (quality aspects, semantic distance, etc.) data integration system {G,S,M} (Global schema, Sources, Mediation) through integration not only of intensional but also of extensional semantics. While the first is provided by an alphabet A as given by an ontology based global schema C, and a high level query language (conjunction/disjunction + inequalities + statistical operations), the latter enables synthesizing of data source specific and previously transformed query results according to well-defined set operations for heterogeneous, distributed data sources. Our approach contrasts with other GAV (Global-As-View) related architectures for mediation of integrated read-only views, in that it simplifies query processing while preserving flexibility when adding new data sources, despite the inherited complexity of mappings due to enhanced semantic description of data (semantic distance, quality parameters, etc.) such that statistical results and comparisons become more meaningful.
Keywords :
data handling; data structures; distributed databases; information resources; query processing; semantic Web; statistical databases; Global-As-View related architectures; MDDQL-Stat; data analysis; data integration system; data quality; data querying; data semantic description; data source synthesizing; distributed data sources; extensional semantics; heterogeneous data sources; high level query language; integrated read-only views; intentional semantics; mapping complexity; ontology based global schema; query processing; semantic distance; statistical operations; Cloning; Data analysis; Database languages; Mediation; Natural languages; Ontologies; Prototypes; Query processing; Statistics; Vocabulary;
Conference_Titel :
Scientific and Statistical Database Management, 2004. Proceedings. 16th International Conference on
Print_ISBN :
0-7695-2146-0
DOI :
10.1109/SSDM.2004.1311230