• DocumentCode
    2846848
  • Title

    Representing and querying data transformations

  • Author

    Velegrakis, Yannis ; Miller, Renée J. ; Mylopoulos, John

  • fYear
    2005
  • fDate
    5-8 April 2005
  • Firstpage
    81
  • Lastpage
    92
  • Abstract
    Modern information systems often store data that has been transformed and integrated from a variety of sources. This integration may obscure the original source semantics of data items. For many tasks, it is important to be able to determine not only where data items originated, but also why they appear in the integration as they do and through what transformation they were derived. This problem is known as data provenance. In this work, we consider data provenance at the schema and mapping level. In particular, we consider how to answer questions such as "what schema elements in the source(s) contributed to this value", or "through what transformations or mappings was this value derived?" Towards this end, we elevate schemas and mappings to first-class citizens that are stored in a repository and are associated with the actual data values. An extended query language, called MXQL, is also developed that allows meta-data to be queried as regular data and we describe its implementation scenario.
  • Keywords
    data structures; meta data; query languages; query processing; data integration; data provenance; data representation; data transformation querying; extended query language; information system; meta-data; schema mapping; Database languages; Government; Information retrieval; Information systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
  • ISSN
    1084-4627
  • Print_ISBN
    0-7695-2285-8
  • Type

    conf

  • DOI
    10.1109/ICDE.2005.123
  • Filename
    1410108