• DocumentCode
    170001
  • Title

    Towards Planning Scientific Experiments through Declarative Model Discovery in Provenance Data

  • Author

    Ferreira Silva, Mateus ; Araujo Baiao, Fernanda ; Revoredo, Kate

  • Author_Institution
    Dept. of Appl. Inf., Fed. Univ. of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Brazil
  • Volume
    2
  • fYear
    2014
  • fDate
    20-24 Oct. 2014
  • Firstpage
    95
  • Lastpage
    98
  • Abstract
    Data provenance is the process of managing a collection of metadata that catalogs the origin and history of data. In scientific workflows, this metadata assists scientists and domain specialists in several tasks, including the reproduction of scientific experiments and planning of new scenarios to be experimented. However, the amount of provenance data generated from scientific workflow executions can grow through time, becoming infeasible for scientists to manually evaluate them. Thus, mechanisms for automatically extracting knowledge from provenance data and presenting them to the user are demanding. Due to the diversity and flexibility inherent to scientific experimentation scenarios, declarative models are potentially adequate. In this work, we propose to apply techniques for learning a declarative model from provenance data generated by scientific workflows, from which the domain specialist will be able to plan future scenarios for his/her scientific experiment. The proposed solution is illustrated in a case study of a scientific experiment on ontology matching, which is a data-intensive strategy that is required to solve the problem of information integration in several areas of knowledge.
  • Keywords
    cataloguing; data integration; knowledge acquisition; learning (artificial intelligence); meta data; ontologies (artificial intelligence); scientific information systems; catalogs; data-intensive strategy; declarative model discovery; domain specialists; information integration; knowledge extraction; learning; metadata collection management; ontology matching; provenance data; scientific experimentation scenarios; scientific experiments planning; scientific experiments reproduction; scientific workflow executions; scientific workflows; scientists; Data mining; Data models; Measurement; Ontologies; Planning; Semantics; Unified modeling language; data provenance; declarative model; scientific experiment; scientific workflow;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    e-Science (e-Science), 2014 IEEE 10th International Conference on
  • Conference_Location
    Sao Paulo
  • Print_ISBN
    978-1-4799-4288-6
  • Type

    conf

  • DOI
    10.1109/eScience.2014.60
  • Filename
    6972106