• DocumentCode
    505970
  • Title

    Advanced data flow support for scientific grid workflow applications

  • Author

    Qin, Jun ; Fahringer, Thomas

  • Author_Institution
    University of Innsbruck, Innsbruck, Austria
  • fYear
    2007
  • fDate
    10-16 Nov. 2007
  • Firstpage
    1
  • Lastpage
    12
  • Abstract
    Existing work does not provide a flexible dataset-oriented data flow mechanism to meet the complex requirements of scientific Grid workflow applications. In this paper we present a sophisticated approach to this problem by introducing a data collection concept and the corresponding collection distribution constructs, which are inspired by HPF, however applied to Grid workflow applications. Based on these constructs, more fine-grained data flows can be specified at an abstract workflow language level, such as mapping a portion of a dataset to an activity, independently distributing multiple datasets, not necessarily with the same number of data elements, onto loop iterations. Our approach reduces data duplication, optimizes data transfers as well as simplifies the effort to port workflow applications onto the Grid. We have extended AGWL with these concepts and implemented the corresponding runtime support in ASKALON. We apply our approach to some real world scientific workflow applications and report performance results.
  • Keywords
    Application software; Computer science; Control systems; Data engineering; Engineering management; Grid computing; Permission; Resource management; Runtime; Technology management; data collection; data distribution; data flow; grid workflow;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Supercomputing, 2007. SC '07. Proceedings of the 2007 ACM/IEEE Conference on
  • Conference_Location
    Reno, NV, USA
  • Print_ISBN
    978-1-59593-764-3
  • Electronic_ISBN
    978-1-59593-764-3
  • Type

    conf

  • DOI
    10.1145/1362622.1362679
  • Filename
    5348801