• DocumentCode
    170308
  • Title

    FragFlow Automated Fragment Detection in Scientific Workflows

  • Author

    Garijo, Daniel ; Corcho, Oscar ; Gil, Yolanda ; Gutman, Boris A. ; Dinov, Ivo D. ; Thompson, Paul ; Toga, Arthur W.

  • Author_Institution
    Dipt. Intel. Artificial, Univ. Politec. de Madrid, Madrid, Spain
  • Volume
    1
  • fYear
    2014
  • fDate
    20-24 Oct. 2014
  • Firstpage
    281
  • Lastpage
    289
  • Abstract
    Scientific workflows provide the means to define, execute and reproduce computational experiments. However, reusing existing workflows still poses challenges for workflow designers. Workflows are often too large and too specific to reuse in their entirety, so reuse is more likely to happen for fragments of workflows. These fragments may be identified manually by users as sub-workflows, or detected automatically. In this paper we present the FragFlow approach, which detects workflow fragments automatically by analyzing existing workflow corpora with graph mining algorithms. FragFlow detects the most common workflow fragments, links them to the original workflows and visualizes them. We evaluate our approach by comparing FragFlow results against user-defined sub-workflows from three different corpora of the LONI Pipeline system. Based on this evaluation, we discuss how automated workflow fragment detection could facilitate workflow reuse.
  • Keywords
    data mining; graph theory; pipeline processing; scientific information systems; FragFlow; LONI pipeline system; automated workflow fragment detection; graph mining algorithms; scientific workflows; user-defined subworkflows; workflow corpora; workflow visualization; Algorithm design and analysis; Data mining; Filtering; Filtering algorithms; Measurement; Neuroimaging; Pipelines; LONI pipeline; scientific workflow; workflow fragment; workflow reuse;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    e-Science (e-Science), 2014 IEEE 10th International Conference on
  • Conference_Location
    Sao Paulo
  • Print_ISBN
    978-1-4799-4288-6
  • Type

    conf

  • DOI
    10.1109/eScience.2014.32
  • Filename
    6972275