DocumentCode :
170001
Title :
Towards Planning Scientific Experiments through Declarative Model Discovery in Provenance Data
Author :
Ferreira Silva, Mateus ; Araujo Baiao, Fernanda ; Revoredo, Kate
Author_Institution :
Dept. of Appl. Inf., Fed. Univ. of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Brazil
Volume :
2
fYear :
2014
fDate :
20-24 Oct. 2014
Firstpage :
95
Lastpage :
98
Abstract :
Data provenance is the process of managing a collection of metadata that catalogs the origin and history of data. In scientific workflows, this metadata assists scientists and domain specialists in several tasks, including the reproduction of scientific experiments and planning of new scenarios to be experimented. However, the amount of provenance data generated from scientific workflow executions can grow through time, becoming infeasible for scientists to manually evaluate them. Thus, mechanisms for automatically extracting knowledge from provenance data and presenting them to the user are demanding. Due to the diversity and flexibility inherent to scientific experimentation scenarios, declarative models are potentially adequate. In this work, we propose to apply techniques for learning a declarative model from provenance data generated by scientific workflows, from which the domain specialist will be able to plan future scenarios for his/her scientific experiment. The proposed solution is illustrated in a case study of a scientific experiment on ontology matching, which is a data-intensive strategy that is required to solve the problem of information integration in several areas of knowledge.
Keywords :
cataloguing; data integration; knowledge acquisition; learning (artificial intelligence); meta data; ontologies (artificial intelligence); scientific information systems; catalogs; data-intensive strategy; declarative model discovery; domain specialists; information integration; knowledge extraction; learning; metadata collection management; ontology matching; provenance data; scientific experimentation scenarios; scientific experiments planning; scientific experiments reproduction; scientific workflow executions; scientific workflows; scientists; Data mining; Data models; Measurement; Ontologies; Planning; Semantics; Unified modeling language; data provenance; declarative model; scientific experiment; scientific workflow;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
e-Science (e-Science), 2014 IEEE 10th International Conference on
Conference_Location :
Sao Paulo
Print_ISBN :
978-1-4799-4288-6
Type :
conf
DOI :
10.1109/eScience.2014.60
Filename :
6972106
Link To Document :
بازگشت