• DocumentCode
    3134510
  • Title

    MDDQL-Stat: data querying and analysis through integration of intentional and extensional semantics

  • Author

    Kapetanios, Epaminondas ; Baer, David ; Glaus, Böjrn ; Groenewoud, Paul

  • Author_Institution
    Plirosoft Ltd., Semantic Technol., Zurich, Switzerland
  • fYear
    2004
  • fDate
    21-23 June 2004
  • Firstpage
    353
  • Lastpage
    356
  • Abstract
    We would like to present a prototype system enabling a rather empirical than a formal approach to the problem of posing queries to a semantically rich (quality aspects, semantic distance, etc.) data integration system {G,S,M} (Global schema, Sources, Mediation) through integration not only of intensional but also of extensional semantics. While the first is provided by an alphabet A as given by an ontology based global schema C, and a high level query language (conjunction/disjunction + inequalities + statistical operations), the latter enables synthesizing of data source specific and previously transformed query results according to well-defined set operations for heterogeneous, distributed data sources. Our approach contrasts with other GAV (Global-As-View) related architectures for mediation of integrated read-only views, in that it simplifies query processing while preserving flexibility when adding new data sources, despite the inherited complexity of mappings due to enhanced semantic description of data (semantic distance, quality parameters, etc.) such that statistical results and comparisons become more meaningful.
  • Keywords
    data handling; data structures; distributed databases; information resources; query processing; semantic Web; statistical databases; Global-As-View related architectures; MDDQL-Stat; data analysis; data integration system; data quality; data querying; data semantic description; data source synthesizing; distributed data sources; extensional semantics; heterogeneous data sources; high level query language; integrated read-only views; intentional semantics; mapping complexity; ontology based global schema; query processing; semantic distance; statistical operations; Cloning; Data analysis; Database languages; Mediation; Natural languages; Ontologies; Prototypes; Query processing; Statistics; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Management, 2004. Proceedings. 16th International Conference on
  • ISSN
    1099-3371
  • Print_ISBN
    0-7695-2146-0
  • Type

    conf

  • DOI
    10.1109/SSDM.2004.1311230
  • Filename
    1311230