Title :
Dataflow Oriented Similarity Matching for Scientific Workflows
Author :
Yeo, Philip ; Abidi, Syed S. R.
Author_Institution :
Fac. of Comput. Sci., Dalhousie Univ., Halifax, NS, Canada
Abstract :
Duplicate and redundant workflows can be avoided by encouraging workflow reuse. In this paper, we present how workflow similarity matching approach can be used to further enhance existing workflow modeling tools. Most existing workflow similarity algorithms cater for control-flow oriented types of workflow which are typically associated with business workflows. The increase presence of scientific workflows that are mainly dataflow oriented calls for workflow similarity matching that caters for these types of workflows instead. We demonstrate here how our work of applying a behavioral analysis technique (taking into consideration the causal footprint of the workflow) that has been used for finding similarity in business workflows perform when use for scientific workflows. The distinction of our technique is the use of data provenance within the scientific workflow model where positional information of the workflow activities are taken in consideration in order to find matching workflow models. Preliminary experiments have shown that our proposed solution provides a viable alternative for matching scientific workflows within multiple scenarios. Furthermore, our suggested approach performs better, particularly with the removal and extension types of modification to the original workflow.
Keywords :
business data processing; data flow computing; behavioral analysis technique; business workflows; control-flow oriented types; data provenance; dataflow oriented similarity matching approach; duplicate workflows; redundant workflows; scientific workflows; workflow modeling tools; workflow similarity matching approach; Business; Computational modeling; Finite element analysis; Generators; Indexes; Semantics; Vectors; causal footprint; dataflow oriented workflow; provenance; scientific workflow; workflow reuse; workflow similarity matching;
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location :
Cambridge, MA
Print_ISBN :
978-0-7695-4979-8
DOI :
10.1109/IPDPSW.2013.69