Title :
A Method to Mine Workflows from Provenance for Assisting Scientific Workflow Composition
Author :
Zeng, Reng ; He, Xudong ; van der Aalst, W.M.P.
Author_Institution :
Sch. of Comput. & Inf. Sci., Florida Int. Univ., Miami, FL, USA
Abstract :
Scientific workflows have recently emerged as a new paradigm for representing and managing complex distributed scientific computations and are used to accelerate the pace of scientific discovery. In many disciplines, individual workflows are large and complicated due to the large quantities of data used. As such, the workflow construction is difficult or even impossible when relevant domain knowledge is missing or the workflows require collaboration within multiple domains. Recent efforts from scientific workflow community aiming at large-scale capturing of provenance present a new opportunity for using provenance to provide recommendations during building scientific workflows. This paper presents a method based on provenance to mine models for scientific workflows, including data and control dependency. The mining result can either suggest part of others´ workflows for consideration, or make familiar part of workflow easily accessible, thus provide recommendation support for scientific workflow composition.
Keywords :
data mining; scientific information systems; workflow management software; complex distributed scientific computation; provenance; relevant domain knowledge; scientific discovery; scientific workflow composition; workflow construction; workflow mining; Accuracy; Communities; Data mining; Data models; Educational institutions; Petri nets; Relational databases;
Conference_Titel :
Services (SERVICES), 2011 IEEE World Congress on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4577-0879-4
Electronic_ISBN :
978-0-7695-4461-8
DOI :
10.1109/SERVICES.2011.55